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3. Where required by any of the elected Offices, the International Bureau will prepare an English translation of the 
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translations and paying national fees) within 30 months from the priority date (or later in some Offices) (Article 
39(1)) (see also the reminder sent by the International Bureau with Form PCT/IB/301 ). 
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International application No. PCT/US99/26788 



I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70.17).): 
Description, pages: 

1 -64 as originally filed 



Sequence listing part of the description, pages: 

1-19, as originally fifed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

E9 contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the. information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 



Claims, No.: 



1-32. 



as originally filed 



Drawings, sheets: 



1/23-23/23 



as originally filed 
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□ the description, 

□ the claims, 

□ the drawings, 



pages: 
Nos.: 



sheets: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 

R claims Nos. 1-3, 5, 13, 14, 18-28, 30-32 (completely), 4, 6-12, 15-17, 29 (partially). 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 



□ the description, claims or drawings (indicate particular elements beloW) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specif/): 



IS the claims, or said claims Nos. 1-3, 5, 13, 14, 18-28, 30-32 (completely), 9-12, 15-17 (partially) are so 
inadequately supported by the description that no meaningful opinion could be formed. 

E3 no international search report has been established for the said claims Nos. 1-32 (partially). 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



because: 
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1. Statement 



Novelty (N) 



Yes: 
No: 



Claims 
Claims 



4, 6-8, 29 
9-12, 15-17 



Inventive step (IS) 



Yes: 
No: 



Claims 
Claims 



4, 6-12, 15-17, 29 



Industrial applicability (IA) 



Yes: 
No: 



Claims 
Claims 



4, 6-12, 15-17, 29 



2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: r _ 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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III. Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

1 . As a result of an objection of lack of unity raised by the International Search 
Authority, the international search report has been restricted to the invention 
referred to as invention 1 (claims 1-32, partially), namely: 

a polypeptide IMX4 represented by amino acid sequence ID No. 27 and DNA 
molecules represented by nucleic acid sequences ID No. 1 and ID No. 11, 12 and 
13, antibodies binding to said polypeptide, host cells expressing said polypeptide, 
variants thereof, and therapeutic uses thereof. 

Consequently, an opinion with regard to novelty, inventive step and industrial 
applicability can only be partially formulated, and limited to the above mentioned 
invention 1. 

2. An opinion with regard to novelty, inventive step and industrial applicability cannot 
be formulated for the claims relating to a polypeptide comprising the amino acid 
sequence of SEQ ID No. 27, or a polypeptide encoded by a polynucleotide of SEQ 
ID No. 1 , for the following reasons: 

According to the present application, the polypeptide having the amino acid 
sequence of SEQ ID No. 27 is the translational product of the nucleotide 
sequence of isolated DNA molecule IMX4 of SEQ ID No. 1 (see e.g. page 59, 
example 2). However, in view of the nucleotide sequence of SEQ ID No. 1 , it 
appears that said DNA molecule cannot encode said polypeptide: 

A. The nucleotide sequence of SEQ ID No. 1 contains only one putative 
methionine initiation codon (ATG) at position 29. The corresponding putative open 
reading frame (ORF) stops after three amino acids (KAV) with a stop codon (TAG) 
at position 41 . This is too short to corresponds to an ORF leading to a translated 
product. Thus, there are at the nucleotide level no indications that the nucleotide 
sequence of SEQ ID No. 1 encodes for a polypeptide. 

B. The comparison of the amino acid sequence of SEQ ID No. 27 with the 
nucleotide sequence of SEQ ID No. 1 shows that the codon supposed to 
correspond to the methionine of SEQ ID No. 27 actually corresponds to a stop 
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codon (TAG). Thus, the polypeptide of SEQ ID No. 27 cannot be translated from 
IMX4 as asserted in the present application (see e.g. page 59, line 15). 

C. The alignment of the nucleotide sequence of SEQ ID No. 1 with the 
nucleotide sequence of the soares ovary tumour HbHOT Homo sapiens cDNA 
clone (D1) shows that the nucleotide sequence of SEQ ID No. 1 corresponds to a 
3 1 untranslated region. In fact, the nucleotide sequence of SEQ ID No. 1 shows 
over 96% identity with the 3* untranslated region of said human cDNA clone. This 
alignment shows that the last five A nucleotides of SEQ ID No. 1 are part of the 
polyA tail. A polyadenylation signal motif can also be found at position 1 14. 
Moreover, because of the high mutation rate of non-coding sequences, 3' 
untranslated regions are gene specific. Thus, it can be totally excluded that a 3 1 ^ 
untranslated region would show such a high identity with a coding region. 

Therefore, the nucleotide sequence of SEQ ID No. 1 , which is supposed to 

encode the polypeptide having the amino acid sequence of SEQ ID No. 27, actually 

corresponds to a 3' untranslated region. 

Since the polypeptide of SEQ ID No. 27 does not exist, and since the nucleotide 
sequence of SEQ ID No. 1 does not encode any polypeptide, an opinion with 
regard to novelty, inventive step and industrial applicability on the claims referring 
to said polypeptides, and related claims cannot be formulated (claims 1-3, 5, 13, 
14, 18-28 and 30-32, completely; 9-12 and 15-17, partially). 



V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive 
step or industrial applicability; citations and explanations supporting such 
statement 

1 . Reference is made to the following documents: 

D1 : EMBL Database entry HS1 272990 

Accession Number AA480830 
D2: EMBL Database entry HS6802 

Accession Number Z82215.1 
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2. D1 describes the nucleotide sequence of a cDNA clone comprising a sequence 
showing over 96% identity with SEQ ID No. 1 . 

D2 describes a nucleotide sequence showing over 97% identity with the 
nucleotide sequence of SEQ ID No. 1 . 

Thus, in view of D1 or D2, the subject-matter of claims 9-12 and 15-17 is not 
novel. Therefore, claims 9-12 and 15-17 do not meet the requirements of Article 
33(2) PCT. 

3. The nucleotide sequence of SEQ ID No. 1 described in the present application 
corresponds to the 3' untranslated region of an unknown gene. The present 
application shows that the transcription of said gene is increased when cells are 
treated with interferon-y (example 1). However, this does not give any evidence on 
the specific function of the product of said gene. The function of the product of the 
gene corresponding to the 3' untranslated region of nucleotide sequence of SEQ 
ID No.1 being unknown, the isolation of said nucleotide does not solve any 
problem. Since there is no problem to be solved, there cannot be any inventive 
step involved. Thus, the subject-matter of claims 4, 6-12, 15-17 and 29 is not 
inventive. Therefore, claims 4, 6-12, 15-17 and 29 do not meet the requirements 
of Article 33(3) PCT. 

VII. Certain defects in the international application 

1 . Point (d) of claim 9 is lacking. 

2. The specification mentions that the sequences of SEQ ID No. 1 1 and 12 include 

the sequence of SEQ ID No. 1 (page 59, lines 7-10). It appears that this is not the 

♦ 

case. 

3. The specification does not define the SEQ ID No. 18, 19 and 21-26 although they 
are mentioned in the claims. 

4. Claim 16 refers to a method of making a recombinant host cell comprising the 
isolated nucleic acid molecule of claim 9. Claim 29 refers to a method of 
diagnosing irritable bowel disease or a susceptibility to irritable bowel disease in a 
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subject. Both claims do not mention how said methods are to be performed. 
VIII. Certain observations on the international application 

1. The formulation "...comprises..." in claims 4 and 8-12 does not clearly define the 
scope of the claim. Thus, this expression should be replaced with "consists of" 
(Article 6 PCT). 

2. Claims 9 and 1 2 lack clarity due to the formulation "hybridisable to". This 
formulation is vague and imprecise, it does not provide any technical information 
about the hybridisation conditions, and as a consequence is not adapted to clearly 
define the scope of the claim. In fact, any nucleic acid molecule may hybridise to " 
the nucleic acid molecule of SEQ ID No. 1, when the hybridisation conditions are 
adapted to their degree of identity (Article 6 PCT). 

» 

3. Claim 1 mentions that the DNA molecule of SEQ ID No. 1 has altered expression 
in a T84 model of gut barrier function. However, the specification does not provide 
any evidence for said altered expression. The present application only shows that 
treatment of the T84 cells with interferon-? induces expression of the DNA 
molecule of SEQ ID No. 1. Thus, claim 1 is not based on the description (Article 6- 
support PCT). 

4. Claim 29 refers to a method of diagnosing irritable bowel disease or a 
susceptibility to irritable bowel disease in a subject by determining the presence or 
absence of a polynucleotide. Apart from vague declarations, the present 
application does not provide any evidence showing that the irritable bowel disease 
ma y be evidenced by determining the presence or absence of the polynucleotide 
of SEQ ID No. 1 . Thus, claim 29 is not based on the description (Article 6-support 
PCT). Moreover, since a polynucleotide is always either present or absent, the 
man skilled in the art, faced with the problem of establishing such a diagnosis 
would not be able to perform it with said method (Article 5 PCT). 
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Applicant's or agent's file reference 
98,664-A , 


See Notification of Transmittal of Internationa] 
FOR FURTHER ACTION Preliminary Examination Report (Form PCT/IPEA/416) 


International application No. 
PCT/US99/26788 


International filing date (day/month/year) 
10/11/1999 


Priority date (day/month/year) 
10/11/1998 


International Patent Classification (IPC) or national classification and IPC 
C12N15/12 


Applicant 

DIGITAL GENE TECHNOLOGIES.INC ET AL 



1. This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of 8 sheets, including this cover sheet. 

□ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of sheets. 



3. This report contains indications relating to the following items: 

™ Basis of the report 
Priority 

Non-estabtishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations suporting such statement 

Certain documents cited 

Certain defects in the international application 

Certain observations on the international application 



I 




II 


□ 


III 

IV 
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VI 
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VI! 




VIII 





Date of submission of the demand 
06/06/2000 


Date of completion of this report 
08.03.2001 


Name and mailing address of the international 
preliminary examining authority: 

^ European Patent Office 
/fljl) D-80298 Munich 
Z^r* Tel. +49 89 2399 - 0 Tx: 523656 epmu d 
Fax: +49 89 2399 - 4465 


Authorized officer ^^S^v 

Chavanne, F jy 

Telephone No. +49 89 2399 8399 ^ 
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I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Articie 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70. 1 7).)\ 

Description, pages: 

1 -64 as originally filed 

Claims, No.: 

1 -32 as originally filed 

Drawings, sheets: 

1/23-23/23 as originally filed 

Sequence listing part of the description, pages: 

1-19, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

H contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ the description, 

□ the claims, 

□ the drawings, 



pages: 



sheets: 



Nos.: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 

claims Nos. 1-3, 5, 13, 14, 18-28, 30-32 (completely), 4, 6-12, 15-17, 29 (partially). 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 



□ the description, claims or drawings (indicate particular elements below) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 



S the claims, or said claims Nos. 1-3, 5, 13, 14, 18-28, 30-32 (completely), 9-12, 15-17 (partially) are so 
inadequately supported by the description that no meaningful opinion could be formed. 

K no international search report has been established for the said claims Nos. 1-32 (partially). 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 
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1. Statement 



Novelty (N) 


Yes: 


Claims 


4, 6-8, 29 




No: 


Claims 


9-12, 15-17 


Inventive step (IS) 


Yes: 


Claims 






No: 


Claims 


4, 6-12, 15-17, 29 


Industrial applicability (IA) 


Yes: 


Claims 


4, 6-12, 15-17, 29 




No: 


Claims 





2. Citations and explanations 
see separate sheet 

VII. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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III. Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 

1 . As a result of an objection of lack of unity raised by the International Search 
Authority, the international search report has been restricted to the invention 
referred to as invention 1 (claims 1-32, partially), namely: 

a polypeptide IMX4 represented by amino acid sequence ID No. 27 and DNA 
molecules represented by nucleic acid sequences ID No. 1 and ID No. 11,12 and 
13, antibodies binding to said polypeptide, host cells expressing said polypeptide, 
variants thereof, and therapeutic uses thereof. 

Consequently, an opinion with regard to novelty, inventive step and industrial 
applicability can only be partially formulated, and limited to the above mentioned 
invention 1 . 

2. An opinion with regard to novelty, inventive step and industrial applicability cannot 
be formulated for the claims relating to a polypeptide comprising the amino acid 
sequence of SEQ ID No. 27, or a polypeptide encoded by a polynucleotide of SEQ 
ID No. 1 , for the following reasons: 

According to the present application, the polypeptide having the amino acid 
sequence of SEQ ID No. 27 is the translational product of the nucleotide 
sequence of isolated DNA molecule IMX4 of SEQ ID No. 1 (see e.g. page 59, 
example 2). However, in view of the nucleotide sequence of SEQ ID No. 1 , it 
appears that said DNA molecule cannot encode said polypeptide: 

A. The nucleotide sequence of SEQ ID No. 1 contains only one putative 
methionine initiation codon (ATG) at position 29. The corresponding putative open 
reading frame (ORF) stops after three amino acids (KAV) with a stop codon (TAG) 
at position 41 . This is too short to corresponds to an ORF leading to a translated 
product. Thus, there are at the nucleotide level no indications that the nucleotide 
sequence of SEQ ID No. 1 encodes for a polypeptide. 

B. The comparison of the amino acid sequence of SEQ ID No. 27 with the 
nucleotide sequence of SEQ ID No. 1 shows that the codon supposed to 
correspond to the methionine of SEQ ID No. 27 actually corresponds to a stop 
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codon (TAG). Thus, the polypeptide of SEQ ID No. 27 cannot be translated from 
IMX4 as asserted in the present application (see e.g. page 59, line 15). 

C. The alignment of the nucleotide sequence of SEQ ID No. 1 with the 
nucleotide sequence of the soares ovary tumour HbHOT Homo sapiens cDNA 
clone (D1) shows that the nucleotide sequence of SEQ ID No. 1 corresponds to a 
3' untranslated region. In fact, the nucleotide sequence of SEQ ID No. 1 shows 
over 96% identity with the 3' untranslated region of said human cDNA clone. This 
alignment shows that the last five A nucleotides of SEQ ID No. 1 are part of the 
polyA tail. A polyadenylation signal motif can also be found at position 114. 
Moreover, because of the high mutation rate of non-coding sequences, 3' 
untranslated regions are gene specific. Thus, it can be totally excluded that a 3' 
untranslated region would show such a high identity with a coding region. 

Therefore, the nucleotide sequence of SEQ ID No. 1 , which is supposed to 

encode the polypeptide having the amino acid sequence of SEQ ID No. 27, actually 

corresponds to a 3' untranslated region. 

Since the polypeptide of SEQ ID No. 27 does not exist, and since the nucleotide 
sequence of SEQ ID No. 1 does not encode any polypeptide, an opinion with 
regard to novelty, inventive step and industrial applicability on the claims referring 
to said polypeptides, and related claims cannot be formulated (claims 1-3, 5, 13, 
14, 18-28 and 30-32, completely; 9-12 and 15-17, partially). 

V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive 
step or industrial applicability; citations and explanations supporting such 
statement 

1 . Reference is made to the following documents: 

D1 : EMBL Database entry HS1 272990 

Accession Number AA480830 
D2: EMBL Database entry HS6802 

Accession Number Z8221 5.1 



Form PCT/Separate Sheet/409 (Sheet 2) (EPO-April 1997) 



INTERNATIONAL PRELIMINARY International application No. PCT/US99/26788 

EXAMINATION REPORT - SEPARATE SHEET 



2. D1 describes the nucleotide sequence of a cDNA clone comprising a sequence 
showing over 96% identity with SEQ ID No. 1 . 

D2 describes a nucleotide sequence showing over 97% identity with the 
nucleotide sequence of SEQ ID No. 1. 

Thus, in view of D1 or D2, the subject-matter of claims 9-12 and 15-17 is not 
novel. Therefore, claims 9-12 and 15-17 do not meet the requirements of Article 
33(2) PCT. 

3. The nucleotide sequence of SEQ ID No. 1 described in the present application 
corresponds to the 3* untranslated region of an unknown gene. The present 
application shows that the transcription of said gene is increased when cells are 
treated with interferon-y (example 1). However, this does not give any evidence on 
the specific function of the product of said gene. The function of the product of the 
gene corresponding to the 3' untranslated region of nucleotide sequence of SEQ 
ID No.1 being unknown, the isolation of said nucleotide does not solve any 
problem. Since there is no problem to be solved, there cannot be any inventive 
step involved. Thus, the subject-matter of claims 4, 6-12, 15-17 and 29 is not 
inventive. Therefore, claims 4, 6-12, 15-17 and 29 do not meet the requirements 
of Article 33(3) PCT. 

VII. Certain defects in the international application 

1 . Point (d) of claim 9 is lacking. 

2. The specification mentions that the sequences of SEQ ID No. 1 1 and 12 include 
the sequence of SEQ ID No. 1 (page 59, lines 7-10). It appears that this is not the 
case. 

3. The specification does not define the SEQ ID No. 18, 19 and 21-26 although they 
are mentioned in the claims. 

4. Claim 16 refers to a method of making a recombinant host cell comprising the 
isolated nucleic acid molecule of claim 9. Claim 29 refers to a method of 
diagnosing irritable bowel disease or a susceptibility to irritable bowel disease in a 
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subject. Both claims do not mention how said methods are to be performed. 
VIII. Certain observations on the international application 

1 . The formulation "...comprises..." in claims 4 and 8-12 does not clearly define the 
scope of the claim. Thus, this expression should be replaced with "consists of" 
(Article 6 PCT). 

2. Claims 9 and 12 lack clarity due to the formulation "hybridisable to". This 
formulation is vague and imprecise, it does not provide any technical information 
about the hybridisation conditions, and as a consequence is not adapted to clearly 
define the scope of the claim. In fact, any nucleic acid molecule may hybridise to 
the nucleic acid molecule of SEQ ID No. 1, when the hybridisation conditions are 
adapted to their degree of identity (Article 6 PCT). 

3. Claim 1 mentions that the DNA molecule of SEQ ID No. 1 has altered expression 
in a T84 model of gut barrier function. However, the specification does not provide 
any evidence for said altered expression. The present application only shows that 
treatment of the T84 cells with interferon-y induces expression of the DNA 
molecule of SEQ ID No. 1 . Thus, claim 1 is not based on the description (Article 6- 
support PCT). 

4. Claim 29 refers to a method of diagnosing irritable bowel disease or a 
susceptibility to irritable bowel disease in a subject by determining the presence or 
absence of a polynucleotide. Apart from vague declarations, the present 
application does not provide any evidence showing that the irritable bowel disease 
may be evidenced by determining the presence or absence of the polynucleotide 
of SEQ ID No. 1 . Thus, claim 29 is not based on the description (Article 6-support 
PCT). Moreover, since a polynucleotide is always either present or absent, the 
man skilled in the art, faced with the problem of establishing such a diagnosis 
would not be able to perform it with said method (Article 5 PCT). 
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International application No. 

PCT/ US 99/26788 



International filing date (day/month/year) 

10/11/1999 



(Earliest) Priority Date (day/month/year) 

10/11/1998 



Applicant 



DIGITAL GENE TECHNOLOGIES, INC ET AL 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 1 8. A copy is being transmitted to the International Bureau. 

This Int erna tional Search Report consists of a total of 9 sheets. 

I I It is also accompanied by a copy of each prior art document cited in this report. 



2. 

3. 



Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the intern ationaJ application in the 
language in which it was filed, unless otherwise indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : 

| X | contained in the international application in written form. 

| | filed together with the international application in computer readable form. 

| | furnished subsequently to this Authority in written form. 

PH furnished subsequently to this Authority in computer readble form. 

PH the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

PH the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 

PH Certain claims were found unsearchable (See Box I). 
PC] Unity of invention is lacking (see Box II). 

With regard to the title, 

| [ the text is approved as submitted by the applicant. 

| X | the text has been established by this Authority to read as follows: 

NUCLEIC ACIDS AND POLYPEPTIDES HAVING ALTERED EXPRESSION IN THE T84 GUT 
BARRIER MODEL 



|"X~| m .f L * ext nas Deen established, according to Rule 38.2(b), by this Authority as it appears in Box ML The applicant may, 



With regard to the abstract, 

| | the text is approved as submitted by the applicant. 

within one month from the date of mailing of this international search report, submit comments to this Authority 

The figure of the drawings to be published with the abstract is Figure No. 

I I as suggested by the applicant. [X] N one of the figures 

| | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. (~X~] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

see FURTHER INFORMATION sheet PCT/ISA/210 



2. | X I Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

It has to be noted that sequences ID no. 19, no. 22, no. 26, no. 32 and no. 36 do 
not exist in the originally filed sequence listing on which was based the 
search. There are in total 33 sequences and not 38. But the originally 
numbering of the sequences has been kept. 

3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, thi 
' ' covers only those claims for which fees were paid, specifically claims Nos.: 



this International Search Report 



4. | y [ No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-32 all partially 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| j No protest accompanied the payment of additional search fees. 
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Continuation of Box 1.1 

Although claim 29 as far as it concerns an in vivo method is directed to 
a diagnostic method practised on the human/animal body (rule 39.1 IV 
PCT), the search has been carried out and based on the alleged effects of 
the compound/composition. 

Although claims 27 and 28 are directed to a method of treatment of the 
human/animal body (rule 39.1 IV PCT), the search has been carried out and 
based on the alleged effects of the compound/composition. 

Continuation of Box 1.2 



It has to be noted that sequences ID no. 19, no. 22, no. 26, no. 32 and 
no. 36 do not exist in the originally filed sequence listing on which was 
based the search. There are in total 33 sequences and not 38. But the 
originally numbering of the sequences has been kept. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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1. Claims: 1-32 all partially 

A polypeptide IMX4 represented by amino acid sequence ID 
no. 27 and DNA molecules represented by nucleic acid 
sequences ID no.l and ID no. 11,12 and 13- Host cell. 
Antibody. Variants thereof. Therapeutic uses. 



2. Claims: 1-32 all partially 

A polypeptide IMX10 represented by amino acid sequence ID 
no. 28 and DNA molecules represented by nucleic acid 
sequences ID no. 2 and ID no. 14. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



3. Claims: 1-32 all partially 

A polypeptide IMX21 represented by amino acid sequence ID 
no. 29 and DNA molecules represented by nucleic acid 
sequences ID no. 3 and ID no. 15. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



4. Claims: 1-32 all partially 

A polypeptide IMX28 represented by amino acid sequence ID 
no. 30 and DNA molecules represented by nucleic acid 
sequences ID no. 4 and no. 16. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



5. Claims: 1-32 all partially 

A polypeptide IMX32 represented by amino acid sequence ID 
no. 31 and DNA molecules represented by nucleic acid 
sequences ID no. 5, no. 17 and no. 18. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



6. Claims: 1-32 all partially 

A polypeptide IMX39 represented by amino acid sequence ID 
no. 33 and DNA molecules represented by nucleic acid 
sequences ID no. 20 and no. 6. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



7. Claims: 1-32 all partially 

A polypeptide IMX40 represented by amino acid sequence ID 
no. 34 and DNA molecules represented by nucleic acid 
sequences ID no.7 and ID no. 21. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 
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8. Claims: 1-32 all partially 



A polypeptide IMX42 represented by amino acid sequence ID 
no. 35 and DNA molecules represented by nucleic acid 
sequences ID no. 8. Host cell. Antibody. Variants thereof. 
Therapeutic uses. 



9. Claims: 1-32 all partially 

A polypeptide IMX44 represented by amino acid sequence ID 
no. 37 and DNA molecules represented by nucleic acid 
sequences ID no. 9, no. 23 and no. 24. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



10. Claims: 1-32 all partially 

A polypeptide IMX56 represented by amino acid sequence ID 
no. 38 and DNA molecules represented by nucleic acid 
sequences ID no. 10 and no. 25. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 
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I^^US 99/26788 



The invention is directed to purified and isolated polypeptides, the nucleic 
acids encoding such polypeptides that have altered expression characteristics 
in the T84 gut barrier model, processes for production of recombinant forms of 
such polypeptides, antobodies generated against these polypeptides, fragmented 
peptides derived from these polypeptides and the uses to study inflammatory 
bowel diseas (IBD). y 
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2, | | As a0 searchable claims could be searched without effort justifying an aoVfitionaJ fee, this Authority did not invite payment 

of any additional fee. 



3. I j As only some of the required addftkana) search fees were timely paid by the applicant, this International Search Report 
I 1 covers only those claims for which fees were paid, specifically claims Nos.: 



| v | No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 



restricted to the invention first mentioned in the claims; it is covered by claims Nos. 

1-32 all partially 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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International Application No. PCT/US 99/26788 



FURTHER INFORMATION CONTINUED FROM PCT/iSA/ 210 



Continuation of Box 1.1 

Although claim 29 as far as it concerns an in vivo method is directed to 
a diagnostic method practised on the human/animal body (rule 39.1 IV 
PCT), the search has been carried out and based on the alleged effects of 
the compound/ con?)o si tion. 

Although claims 27 and 28 are directed to a method of treatment of the 
human/animal body (rule 39-1 IV PCT), the search has been carried out and 
based on the alleged effects of the compound/compos i tion. 

Continuation of Box 1.2 

It has to be noted that sequences ID no .19, no. 22, no. 26, no. 32 and 
no. 36 do not exist in the originally filed sequence listing on which was 
based the search. There are in total 33 sequences and not 38. But the 
originally numbering of the sequences has been kept. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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1. Claims: 1-32 all partially 

A polypeptide IMX4 represented by amino acid sequence ID 
no. 27 and DNA molecules represented by nucleic acid 
sequences ID no.l and ID no. 11, 12 and 13. Host cell. 
Antibody. Variants thereof. Therapeutic uses. 



2. Claims: 1-32 all partially 

A polypeptide I MX 10 represented by amino acid sequence ID 
no. 28 and DNA molecules represented by nucleic acid 
sequences ID no. 2 and ID no. 14. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



3. Claims: 1-32 all partially 

A polypeptide IMX21 represented by amino acid sequence ID 
no. 29 and DNA molecules represented by nucleic acid 
sequences ID no. 3 and ID no. 15. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



4. Claims: 1-32 all partially 

A polypeptide INX28 represented by amino acid sequence ID 
no. 30 and DNA molecules represented by nucleic acid 
sequences ID no. 4 and no. 16. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



5. Claims: 1-32 all partially 

A polypeptide IMX32 represented by amino acid sequence ID 
no. 31 and DNA molecules represented by nucleic acid 
sequences ID no. 5, no. 17 and no. 18. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



6. Claims: 1-32 all partially 

A polypeptide IMX39 represented by amino acid sequence ID 
no. 33 and DNA molecules represented by nucleic acid 
sequences ID no. 28 and no. 6. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



7. Claims: 1-32 all partially 

A polypeptide IMX40 represented by amino acid sequence ID 
no. 34 and DNA molecules represented by nucjeic acid 
sequences ID no. 7 and ID no. 21. Host eel 1 . Antibody. 
Variants thereof. Therapeutic uses. 
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8. Claims: 1-32 all partially 



A polypeptide IMX42 represented by amino acid sequence ID 
no. 35 and DNA molecules represented by nucleic acid 
sequences ID no. 8. Host cell. Antibody. Variants thereof. 
Therapeutic uses. 



9. Claims: 1-32 all partially 

A polypeptide IMX44 represented by amino acid sequence ID 
no. 37 and DNA molecules represented by nucleic acid 
sequences ID no. 9, no. 23 and no. 24. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



10. Claims: 1-32 all partially 

A polypeptide IMX56 represented by amino acid sequence ID 
no. 38 and DNA molecules represented by nucleic acid 
sequences ID no. 19 and no. 25. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



KENT COOPERATION TREATY 




From the INTERNATIONAL SEARCHING AUTHORITY 



To: 

McDonnell boehnen hulbert 
& berghoff 

Attn. Zimmerman, Roger P. 

300 South Wacker Drive, Suite 3200 

CHICAGO, IL 60606 

UNITED STATES OF AMERICA 



PCT 




NOTIFICATION OF TRANSMITTAL OF 
THE INTERNATIONAL SEARCH REPORT 
OR THE DECLARATION 

(PCT Rule 44.1) 



Date of mailing 
(day/month/year) 



16/08/2000 



Applicant's or agents file reference 

98,664-A 



FOR FURTHER ACTION 



See paragraphs 1 and 4 below 



International application No. 

PCT/ US 99/26788 



International filing date 
(day/month/year) 



10/11/1999 



Applicant 

DIGITAL GENE TECHNOLOGIES, INC ET AL 



1 . The applicant is hereby notified that the International Search Report has been established and is transmitted herewith. 

Filing of amendments and statement under Article 19: 

The applicant is entitled, if he so wishes, to amend the claims of the International Application (see Rule 46): 

When? The time limit for filing such amendments is normally 2 months from the date of transmittal of the 
International Search Report; however, for more details, see the notes on the accompanying sheet. 



Where? Directly to the 



International Bureau of WIPO 
34, chemin des Colombettes 
1 21 1 Geneva 20, Switzerland 
Faacimile No.: (41-22) 740.14.35 



For more detailed instructions, see the notes on the accompanying sheet. 



□ The applicant is hereby notified that no International Search Report will be established and that the declaration under 
Article 17(2)(a) to that effect is transmitted herewith. 



| [ With regard to the protest against payment of (an) adcStional fee(s) under Rule 40.2, the applicant is notified that: 

□ the protest together with the decision thereon has been transmitted to the International Bureau together with the 
applicant's request to forward the texts of both the protest and the decision thereon to the designated Offices. 

| | no decision has been made yet on the protest; the applicant will be notified as soon as a decision is made. 



4. Further action(s): The applicant is reminded of the following: 

Shortly after 18 months from the priority date, the international application will be published by the International Bureau. 
If the applicant wishes to avoid or postpone publication, a notice of withdrawal of the international application, or of the 
priority claim, must reach the International Bureau as provided in Rules 90b/s.1 and 90bis.3, respectively, before the 
completion of the technical preparations for international publication. 

Within 19 months from the priority date, a demand for international preliminary examination must be filed if the applicant 
wishes to postpone the entry into the national phase until 30 months from the priority date On some Offices even later). 

Within 20 months from the priority date, the applicant must perform the prescribed acts for entry into the national phase 
before all designated Offices which have not been elected in the demand or in a later election within 1 9 months from the 
priority date or could not be elected because they are not bound by Chapter II. 




Name and mailing address of the International Searching Authority 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280 HV Rijswijk 

Tel. (+31 -70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 




Authorized officer 



Catherine Humbert 



Form PCT/ISA/220 (July 1998) 



NOTE^Bb FORM PCT/ISA/220 



These Notes are intended to give the basic instructions concerning the filing of amendments under article 19. The 
Notes are based on the requirements of the Patent Cooperation Treaty, the Regulations and the Administrative Instructions 
under that Treaty. In case of discrepancy between these Notes and those requirements, the latter are applicable. For more 
detailed information, see also the PCT Applicant's Guide, a publication of WIPO. 

In these Notes, 'Article", "Rule", and "Section* refer to the provisions of the PCT t the PCT Regulations and the PCT 
Administrative Instructions, respectively. 



INSTRUCTIONS CONCERNING AMENDMENTS UNDER ARTICLE 19 



The applicant has, after having received the international search report, one opportunity to amend the claims of the 
international application. It should however be emphasized that, since ail parts of the international application (claims, 
description and drawings) may be amended during the international preliminary examination procedure, there is usually 
no need to file amendments of the claims under Article 19 except where, e.g. the applicant wants the latter to be published 
for the purposes of provisional protection or has another reason for amending the claims before international publication. 
Furthermore, it should be emphasized that provisional protection is available in some States only. 



What parts of the international application may be amended? 

Under Article 1 9, only the claims may be amended. 

During the international phase, the claims may also be amended (or further amended) under Article 34 before 
the International Preliminary Examining Authority. The description and drawings may only be amended under 
Article 34 before the International Examining Authority. 

Upon entry into the national phase, all parts of the international application may be amended under Article 28 
or, where applicable, Article 41 . 



When? Within 2 months from the date of transmittal of the international search report or 1 6 months from the priority 

date, whichever time limit expires later. It should be noted, however, that the amendments will be considered 
as having been received on time if they are received by the International Bureau after the expiration of the 
applicable time limit but before the completion of the technical preparations for international publication 
(Rule 46.1). 



Where not to file the amendments? 

The amendments may only be filed with the International Bureau and not with the receiving Office or the 
International Searching Authority (Rule 46.2). 

Where a demand for international preliminary examination has been/is filed, see below. 



How? Either by cancelling one or more entire claims, by adding one or more new claims or by amending the text of 

one or more of the claims as filed. 

A replacement sheet must be submitted for each sheet of the claims which, on account of an amendment or 
amendments, differs from the sheet originally filed. 

All the claims appearing on a replacement sheet must be numbered in Arabic numerals. Where a claim is 
cancelled, no renumbering of the other claims is required. In all cases where claims are renumbered, they must 
be renumbered consecutively (Administrative Instructions, Section 205(b)). 

The amendments must be made In the language In which the International application Is to be published. 



What documents must/may accompany the amendments? 

Letter (Section 205(b)): 

The amendments must be submitted with a letter. 

The letter will not be published with the international application and the amended claims. It should not be 
confused with the "Statement under Article 19(1)" (see below, under "Statement under Article 19(1)*). 

The letter must be In English or French, at the choice of the applicant However, If the language of the 
International application is English, the letter must be in English; if the language of the International application 
is French, the letter must be in French. 
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The letter must indicate the differences between the claims as filed and the claims as amended. It must, in 
particular, indicate, in connection with each claim appearing in the international application fit being understood 
that identical indications concerning several claims may be grouped),whether 

(i) the claim is unchanged; 

(ii) the claim is cancelled; 
(Hi) the claim is new; 

(iv) the claim replaces one or more claims as filed; 

(v) the claim is the result of the division of a claim as filed. 



The following examples illustrate the manner In which amendments must be explained In the 
accompanying letter: 

1 . [Where originally there were 48 claims and after amendment of some claims there are 51 ]: 
"Claims 1 to 29, 31 , 32, 34, 35, 37 to 48 replaced by amended claims bearing the same numbers; 
claims 30, 33 and 36 unchanged; new claims 49 to 51 added." 

2. [Where originally there were 1 5 claims and after amendment of alt claims there are 11]: 
"Claims 1 to 15 replaced by amended claims 1 to 1 1 .* 

3. [Where originally there were 14 claims and the amendments consist in cancelling some claims and in adding 
new claims]: 

"Claims 1 to 6 and 14 unchanged; claims 7 to 13 cancelled; new claims 15, 16 and 17 added." or 
"Claims 7 to 13 cancelled; new claims 1 5, 16 and 17 added; all other claims unchanged." 

4. {Where various kinds of amendments are made]: 

"Claims 1 -1 0 unchanged; claims 1 1 to 1 3, 1 8 and 1 9 cancelled; claims 1 4, 1 5 and 1 6 replaced by amended 
claim 14; claim 17 subdivided into amended claims 15, 16 and 17; new claims 20 and 21 added." 



"Statement under article 19(1)~ (Rule 46.4) 

The amendments may be accompanied by a statement explaining the amendments and indicating any impact 
that such amendments might have on the description and the drawings (which cannot be amended under 
Article 1 9(1 )). 

The statement will be published with the international application and the amended claims. 
It must be In the language in which the International application is to be published. 

It must be brief, not exceeding 500 words if in English or if translated into English. 

It should not be confused with and does not replace the letter indicating the cfifferences between the claims 
as filed and as amended. It must be filed on a separate sheet and must be identified as such by a heading, 
preferably by using the words "Statement under Article 1 9(1 )." 

It may not contain any disparaging comments on the international search report or the relevance of citations 
contained in that report. Reference to citations, relevant to a given claim, contained in the international search 
report may be made only in connection with an amendment of that claim. 



Consequence if a demand for International preliminary examination has already been filed 

If, at the time of filing any amendments and any accompanying statement, under Article 19, a demand for 
international preliminary examination has already been submitted, the applicant must preferably, at the time of 
filing the amendments (and any statement ) with the International Bureau, also file with the International 
Preliminary Examining Authority a copy of such amendments (and of any statement) and, where required, a 
translation of such amendments for the procedure before that Authority (see Rules 55.3(a) and 62.2, first 
sentence). For further information, see the Notes to the demand form (PCT/l PEA/401). 

Consequence with regard to translation of the international application for entry into the national phase 

The applicants attention is drawn to the fact that, upon entry into the national phase, a translation of the 
claims as amended under Article 19 may have to be furnished to the designated/elected Offices, instead of, or 
in addition to, the translation of the claims as filedf 

For further details on the requirements of each designated/elected Office, see Volume II of the PCT Applicant's 
Guide. 
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PCT 




INTERNATIONAL SEARCH REPORT 

(PCT Article 1 8 and Rules 43 and 44) 



Applicant's or agent's file reference 

98,664-A 


POR FURTHER 368 Notification of Transmittal of International Search Report 

(Form PCT/ISA/220) as well as, where applicable, item 5 below. 

ACTION 


International application No. 

PCT/ US 99/ 26788 


International filing date (day/month/year) 

10/11/1999 


(Earliest) Priority Date (day/month/year) 

10/11/1998 


Applicant 

DIGITAL GENE TECHNOLOGIES, INC ET AL 



This Intern ationai Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
accorcfing to Article 1 8. A copy is being transmitted to the International Bureau. 

This International Search Report consists of a total of 9 sheets. 

| | it is also accompanied by a copy of each prior art document cited in this report. 



Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
1 — 1 Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino add sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing 



contained in the international application in written form. 

filed together with the international application in computer readable form. 

furnished subsequently to this Authority in written form. 

furnished subsequently to this Authority in computer readbJe form. 

the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 



[X] Certain claims were found unsearchable (See Box I). 
[T| Unity of Invention Is lacking (see Box II). 

With regard to the title, 

| | the text is approved as submitted by the applicant. 

|X| the text has been established by this Authority to read as follows: 

NUCLEIC ACIDS AND POLYPEPTIDES HAVING ALTERED EXPRESSION IN THE T84 GUT 
BARRIER MODEL 

With regard to the abstract, 

[ | the text is approved as submitted by the applicant. 

m the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
L^-l within one month from the date of mailing of this international search report, submit comments to this Authority. 

The figure of the drawings to be published with the abstract is Figure No. 

□ as suggested by the applicant. E None of * e fi 9 ure3 
| because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. . 
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INTERNATIO 




SEARCH REPORT 



^ktemational application No. 

PCT/US 99/26788 



Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. fx] Claims Nos.: 

— because they relate to subject matter not required to be searched by this Authority, namely: 

see FURTHER INFORMATION sheet PCT/ISA/210 



2. I X | Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: 

It has to be noted that sequences ID no. 19, no. 22, no. 26, no. 32 and no. 36 do 
not exist in the originally filed sequence listing on which was based the 
search. There are in total 33 sequences and not 38. But the originally 
numbering of the sequences has been kept. 

3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

» _ ■ 

Box II Observations where unity of Invention Is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
* ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. | Y | No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



1-32 all partially 



Remark on Protest 



[ | The additional search fees were accompanied by the applicant's protest. 
| | No protest accompanied the payment of additional search fees. 
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International Application No. PCT/US 99/26788 



FURTHER INFORMATION CONTINUED FROM PCT/1SA/ 210 



Continuation of Box 1.1 

Although claim 29 as far as it concerns an in vivo method is directed to 
a diagnostic method practised on the human/animal body (rule 39.1 IV 
PCT), the search has been carried out and based on the alleged effects of 
the compound/composition. 

Although claims 27 and 28 are directed to a method of treatment of the 
human/animal body (rule 39.1 IV PCT), the search has been carried out and 
based on the alleged effects of the compound/composition. 

Continuation of Box 1.2 



It has to be noted that sequences ID no. 19, no. 22, no. 26, no. 32 and 
no. 36 do not exist in the originally filed sequence listing on which was 
based the search. There are in total 33 sequences and not 38. But the 
originally numbering of the sequences has been kept. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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1. Claims: 1-32 all partially 



A polypeptide IMX4 represented by amino acid sequence ID 
no. 27 and DNA molecules represented by nucleic acid 
sequences ID no.l and ID no. 11,12 and 13. Host cell. 
Antibody. Variants thereof. Therapeutic uses. 



2. Claims: 1-32 all partially 

A polypeptide IMX10 represented by amino acid sequence ID 
no. 28 and DNA molecules represented by nucleic acid 
sequences ID no. 2 and ID no. 14. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



3. Claims: 1-32 all partially 

A polypeptide IMX21 represented by amino acid sequence ID 
no. 29 and DMA molecules represented by nucleic acid 
sequences ID no. 3 and ID no. 15. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



4. Claims: 1-32 all partially 

A polypeptide IMX28 represented by amino acid sequence ID 
no. 30 and DNA molecules represented by nucleic acid 
sequences ID no. 4 and no. 16. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



5. Claims: 1-32 all partially 

A polypeptide IMX32 represented by amino acid sequence ID 
no. 31 and DNA molecules represented by nucleic acid 
sequences ID no. 5, no. 17 and no. 18. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



6. Claims: 1-3? all partially 

A polypeptide IMX39 represented by amino acid sequence ID 
no. 33 and DNA molecules represented by nucleic acid 
sequences ID no. 20 and no. 6. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 



7. Claims: 1-32 all partially 

A polypeptide IMX40 represented by amino acid sequence ID 
no. 34 and DNA molecules represented by nucleic acid 
sequences ID no. 7 and ID no. 21. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 
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8. Claims: 1-32 all partially 



A polypeptide IMX42 represented by amino acid sequence ID 
no. 35 and DNA molecules represented by nucleic acid 
sequences ID no. 8. Host cell. Antibody. Variants thereof. 
Therapeutic uses. 



9. Claims: 1-32 all partially 

A polypeptide IMX44 represented by amino acid sequence ID 
no. 37 and DNA molecules represented by nucleic acid 
sequences ID no. 9, no. 23 and no. 24. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



10. Claims: 1-32 all partially 

A polypeptide IMX56 represented by amino acid sequence ID 
no. 38 and DNA molecules represented by nucleic acid 
sequences ID no. 10 and no. 25. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 
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Box III TEXT OF THE ABSTRACT (Continuation of item 5 of the first sheet) 



tional application No. 

US 99/26788 



The invention is directed to purified and isolated polypeptides, the nucleic 
acids encoding such polypeptides that have altered expression characteristics 
in the T84 gut barrier model, processes for production of recombinant forms of 
such polypeptides, antobodies generated against these polypeptides, fragmented 
peptides derived from these polypeptides and the uses to study inflammatory 
bowel diseas (IBD). 
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INTERNATIONAL SEARCH REPORT 




Intentional Apf. .on No 

^Jbs 99/26788 



A. CLASSIFICATION OF SUBJECT MATTER » ^ ™-r„<. A . A , rnni „ ICQ 

IPC 7 C12N15/12 C12Q1/68 A61K38/17 C07K14/47 G01N33/68 
C07K16/18 A61P1/0Q 

According to International Patent Classification (IPC) of to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N C07K 

Documentation searched other than minimum documentation to the extent that »uoh document* are included in the fields searched 



Etectronio data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 



Citation of document, with indication, where appropriate, of the relevant passages 

L. HILLIER ET AL: "washU-Merck EST 
project 1997. zx87d07.rl Soares tumor 
NbHOT Homo sapiens cDNA clone IMAGE:810733 
5' , mRNA sequence" 

EMBL DATABASE ENTRY HS1272990, ACCESSION 
NUMBER AA480830.24 June 1997 (1997-06-24), 
XP002136972 
& UNPUBLISHED, 

(97% identity with the entire seq ID no.l) 

-/-- 



j x| Further documents are listed in the oontinuation of box C. 

• Special categories of cited documents : 

"A" document defining the general state-of the art which is not 

considered to be of particular relevance 
'E* earlier document but published on or after the international 

filing date 

"L" document which may throw doubts on priority olaim(s) or 
which a cited to establish the publication date of another 
citation or other special reason (as specified) 

'O* document referring to an oral cfisclosure, use, exhibition or 
other means 

*P* document published prior to the international filing date but 
later than the priority date claimed 

Date of the actual completion of the international search 



4 May 2000 

Name and mailing address of the ISA 

European Patent Office, P.B. 581 8 Patentlaan 2 
NL • 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 

Form PCT/lSA/210 (second sheet) (Juh/ 1992) 



Relevant to claim No. 

4,8,9, 
12,15-17 



□ 



Patent family members are listed in annex. 



T* later document published after the international filing date 
or priority date and not in conflict with the application but 
orted to understand the principle or theory underlying the 
invention 

•X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

•Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art 

* V document member of the same patent family 

Date of mailing of the international search, report 



1 & oa zooo 



Authorized officer 



LE C0RNEC N.D.R. 
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C(Continuation) DOCUMENTS C0NSIDEHlP?O BE RELEVANT 



IntemationaJ Apf on No 

jj^)S_99/26788 



Category 0 I Citation of document, with indication, where appropriate, of the relevant passages 



P.N. DUCHATEAU ET AL: "Homo sapiens 

apol ipoprotein L mRNA, complete cds" 

EMBL DATABASE ENTRY AF019225, ACCESSION 

NUMBER AF019225, 

29 September 1997 (1997-09-29), 

XP002136973 

cited in the application 

-& P.N. DUCHATEAU ET AL: "Apol ipoprotein 

L, a new human high density lipoprotein 

apol ipoprotein expressed by the pancreas. 

Identification, cloning, characterization, 

and plasma distribution" 

JOURNAL OF BIOLOGICAL CHEMISTRY., 

vol. 272, no. 41, 1997, pages 25576-25582, 

XP002136974 

AMERICAN SOCIETY OF BIOLOGICAL CHEMISTS, 
BALTIMORE, MD., US 
ISSN: 0021-9258 

D. LLOYD: "Human DNA sequence from clone 
RP1-6802 on chromosome 22 contains the 5' 
end of the AP0L2 gene for apol ipoprotein 
L2, the APOLgene for apol ipoprotein L, the 
MYH9 for nonmuscletype myosin heavy chain 
9. ESTs, STSs and GSSs" 
EMBL DATABASE ENTRY HS6802, ACCESSION 
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f^(j the actual date of receipt of the demand by this Authority (Rule 61.1(b)). 

| [ the actual date of receipt of the demand on behalf of this Authority (Rule 59.3(e)). 

| | the date on which this Authority has, in response to the invitation to correct defects in the demand 
(Form PCT/I PEA/404), received the required corrections. 

| | ATTENTION: That date of receipt is AFTER the expiration of 19 months from the priority date. Consequently, the 
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1 . This written opinion is the first drawn up by this International Preliminary Examining Authority. 

2. This opinion contains indications relating to the following items: 

Basis of the opinion 
Priority 

Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

Certain document cited 

Certain defects in the international application 

Certain observations on the international application 

The applicant is hereby invited to reply to this opinion. 
When? 
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II 
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III 




IV 


□ 


V 




VI 
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VII 




VIII 





How? 



Also: 



See the time limit indicated above. The applicant may, before the expiration of that time limit 
request this Authority to grant an extension, see Rule 66.2(d). 

By submitting a written reply, accompanied, where appropriate, by amendments, according to Rule 66.3. 
For the form and the language of the amendments, see Rules 66.8 and 66.9. 

For an additional opportunity to submit amendments, see Rule 66.4. 

For the examiner's obligation to consider amendments and/or arguments, see Rule 66.4 bis. 

For an informal communication with the examiner, see Rule 66.6. 
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I. Basis of the opinion 

1 . This opinion has been drawn on the basis of {substitute sheets which have been furnished to the receiving Office 
in response to an invitation under Article 14 are referred to in this opinion as "originally filed" .): 

Description, pages: 

1 -64 as originally filed 

Claims, No.: 

1 -32 as originally filed 

Drawings, sheets: 

1/23-23/23 as originally filed ' 



2. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

3. This opinion has been established as if (some of) the amendments had not been made, since they have been 
considered to go beyond the disclosure as filed (Rule 70.2(c)): 

4. Additional observations, if necessary: 



III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

The questions whether the daimed invention appears to be novel, to involve an inventive step (to be non-obvious), 
or to be industrially applicable have not been and will not be examined in respect of: 

□ the entire international application, 

H claims Nos. 1-3, 5, 13, 14, 18-28, 30-32 (completely), 4, 6-12, 15-17, 29 (partially), 
because: 

□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 
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□ the description, claims or drawings {indicate particular elements be/ow) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 

H the claims, or said claims Nos. 1-3, 5, 13, 14, 18-28. 30-32 (completely), 9-12, 15-17 (partially) are so 
inadequately supported by the description that no meaningful opinion could be formed. 

E9 no international search report has been established for the said claims Nos. 1-32 (partially). 

V. Reasoned statement under Rule 66.2(a)(tt) with regard to novelty, inventive step or industrial 

applicability; citations and explanations supporting such statement r _ 

1. Statement 

Novelty (N) Claims 9-12, 15-17 

Inventive step (IS) Claims 4, 6-12, 15-17, 29 

Industrial applicability (IA) Claims 

2. Citations and explanations 
see separate sheet 

Vli. Certain defects in the international application 

* 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 

see separate sheet 
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III. Non-establishment of opinion with regard to novelty, inventive step and 
industrial applicability 



1 . As a result of an objection of lack of unity raised by the International Search 
Authority, the international search report has been restricted to the invention 
referred to as invention 1 (claims 1-32, partially), namely: 

a polypeptide IMX4 represented by amino acid sequence ID No. 27 and DNA 
molecules represented by nucleic acid sequences ID No. 1 and ID No. 11,12 and 
13, antibodies binding to said polypeptide, host cells expressing said polypeptide, 
variants thereof, and therapeutic uses thereof. 

Consequently, an opinion with regard to novelty, inventive step and industrial 
applicability can only be partially formulated, and limited to the above mentioned^ 
invention 1. 

2. An opinion with regard to novelty, inventive step and industrial applicability cannot 
be formulated for the claims relating to a polypeptide comprising the amino acid 
sequence of SEQ ID No. 27, or a polypeptide encoded by a polynucleotide of SEQ 
ID No. 1, for the following reasons: 

According to the present application, the polypeptide having the amino acid 
sequence of SEQ ID No. 27 is the translational product of the nucleotide 
sequence of isolated DNA molecule IMX4 of SEQ ID No. 1 (see e.g. page 59, 
example 2). However, in view of the nucleotide sequence of SEQ ID No. 1, it 
appears that said DNA molecule cannot encode said polypeptide: 

A. The nucleotide sequence of SEQ ID No. 1 contains only one putative 
methionine initiation codon (ATG) at position 29. The corresponding putative 
open reading frame (ORF) stops after three amino acids (KAV) with a stop 
codon (TAG) at position 41 . This is too short to corresponds to an ORF 
leading to a translated product. Thus, there are at the nucleotide level no 
indications that the nucleotide sequence of SEQ ID No. 1 encodes for a 
polypeptide. 

B. The comparison of the amino acid sequence of SEQ ID No. 27 with the 
.nucleotide sequence of SEQ ID No. 1 shows that the codon supposed to 
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correspond to the methionine of SEQ ID No. 27 actually corresponds to a 
stop codon (TAG). Thus, the polypeptide of SEQ ID No. 27 cannot be 
translated from IMX4 as asserted in the present application (see e.g. page 

59, line 15). 

C. The alignment of the nucleotide sequence of SEQ ID No. 1 with the 

nucleotide sequence of the soares ovary tumour HbHOT Homo sapiens 
cDNA clone (D1) shows that the nucleotide sequence of SEQ ID No. 1 
corresponds to a 3" untranslated region. In fact, the nucleotide sequence of 
SEQ ID No. 1 shows over 96% identity with the 3" untranslated region of said 
human cDNA clone. This alignment shows that the last five A nucleotides of 
SEQ ID No. 1 are part of the polyA tail. A polyadenylation signal motif can ^ 
also be found at position 114. Moreover, because of the high mutation rate 
of non-coding sequences, 3' untranslated regions are gene specific. Thus, it 
can be totally excluded that a 3' untranslated region would show such a high 
identity with a coding region. 

Therefore, the nucleotide sequence of SEQ ID No. 1 , which is supposed to 
encode the polypeptide having the amino acid sequence of SEQ ID No. 27, 
actually corresponds to a 3' untranslated region. 

Since the polypeptide of SEQ ID No. 27 does not exist, and since the nucleotide 
sequence of SEQ ID No. 1 does not encode any polypeptide, an opinion with 
regard to novelty, inventive step and industrial applicability on the claims referring 
to said polypeptides, and related claims cannot be formulated (claims 1-3, 5, 13, 
14, 18-28 and 30-32, completely; 9-12 and 15-17, partially). 



V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive 
step or industrial applicability; citations and explanations supporting such 
statement 

1 . Reference is made to the following documents: 

D 1 : EMBL Database entry HS1 272990 
Accession Number AA480830 
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D2: EMBL Database entry HS6802 
Accession Number Z82215.1 

2. D1 describes the nucleotide sequence of a cDNA clone comprising a sequence 
showing over 96% identity with SEQ ID No. 1. 

D2 describes a nucleotide sequence showing over 97% identity with the 
nucleotide sequence of SEQ ID No. 1. 

Thus, in view of D1 or D2, the subject-matter of claims 9-12 and 15-17 is not 
novel. Therefore, claims 9-12 and 15-17 do not meet the requirements of Article 
33(2) PCT. 

3. The nucleotide sequence of SEQ ID No. 1 described in the present application ^_ 
corresponds to the 3' untranslated region of an unknown gene. The present 
application shows that the transcription of said gene is increased when cells are 
treated with interferon-y (example 1). However, this does not give any evidence on 
the specific function of the product of said gene. The function of the product of the 
gene corresponding to the 3* untranslated region of nucleotide sequence of SEQ 
ID No.1 being unknown, the isolation of said nucleotide does not solve any 
problem. Since there is no problem to be solved, there cannot be any inventive 
step involved. Thus, the subject-matter of claims 4, 6-12, 15-17 and 29 is not 
inventive. Therefore, claims 4, 6-12, 15-17 and 29 do not meet the requirements 
of Article 33(3) PCT. 

VII. Certain defects in the international application 

1 . Point (d) of claim 9 is lacking. 

2. The specification mentions that the sequences of SEQ ID No. 1 1 and 12 include 
the sequence of SEQ ID No. 1 (page 59, lines 7-10). It appears that this is not the 
case. 

3. The specification does not define the SEQ ID No. 18, 19 and 21-26 although they 
are mentioned in the claims. 
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4. Claim 16 refers to a method of making a recombinant host cell comprising the 
isolated nucleic acid molecule of claim 9. Claim 29 refers to a method of 
diagnosing irritable bowel disease or a susceptibility to irritable bowel disease in a 
subject. Both claims do not mention how said methods are to be performed. 

VIII. Certain observations on the international application 

1 . The formulation "...comprises..." in claims 4 and 8-12 does not clearly define the 
scope of the claim. Thus, this expression should be replaced with "consists of" 
(Article 6 PCT). 

2. Claims 9 and 12 lack clarity due to the formulation "hybridisable to". This 
formulation is vague and imprecise, it does not provide any technical information 
about the hybridisation conditions, and as a consequence is not adapted to clearly 
define the scope of the claim. In fact, any nucleic acid molecule may hybridise to 
the nucleic acid molecule of SEQ ID No. 1, when the hybridisation conditions are 
adapted to their degree of identity (Article 6 PCT). 

3. Claim 1 mentions that the DNA molecule of SEQ ID No. 1 has altered expression 
in a T84 model of gut barrier function. However, the specification does not provide 
any evidence for said altered expression. The present application only shows that 
treatment of the T84 cells with interferon-y induces expression of the DNA 
molecule of SEQ ID No. 1. Thus, claim 1 is not based on the description (Article 6- 
support PCT). 

4. Claim 29 refers to a method of diagnosing irritable bowel disease or a 
susceptibility to irritable bowel disease in a subject by determining the presence or 
absence of a polynucleotide. Apart from vague declarations, the present 
application does not provide any evidence showing that the irritable bowel disease 
may be evidenced by determining the presence or absence of the polynucleotide 
of SEQ ID No. 1. Thus, claim 29 is not based on the description (Article 6-support 
PCT). Moreover, since a polynucleotide is always either present or absent, the 
man skilled in the art, faced with the problem of establishing such a diagnosis 
would not be able to perform it with said method (Article 5 PCT). 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

see FURTHER INFORMATION sheet PCT/ ISA/210 



Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



It has to be noted that sequences ID no. 19, no. 22, no. 26, no. 32 and no. 36 do 
not exist in the originally filed sequence listing on which was based the 
search. There are in total 33 sequences and not 38. But the originally 
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A polypeptide IMX4 represented by amino acid sequence ID 
no. 27 and DNA molecules represented by nucleic acid 
sequences ID no.l and ID no. 11,12 and 13. Host cell. 
Antibody. Variants thereof. Therapeutic uses. 


2. 


Claims: 1-32 all partially 






A polypeptide IMX10 represented by amino acid sequence 
no. 28 and DNA molecules represented by nucleic acid 
sequences ID no. 2 and ID no. 14. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 


ID 


3. 


Claims: 1-32 all partially 






A polypeptide IMX21 represented by amino acid sequence 
no. 29 and DNA molecules represented by nucleic acid 
sequences ID no. 3 and ID no. 15. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 


ID 


4. 


Claims: 1-32 all partially 






A polypeptide IMX28 represented by amino acid sequence ID 
no. 30 and DNA molecules represented by nucleic acid 
sequences ID no. 4 and no. 16. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 


5. 


Claims: 1-32 all partially 






A polypeptide IMX32 represented by amino acid sequence ID 
no. 31 and DNA molecules represented by nucleic acid 
sequences ID no. 5, no. 17 and no. 18. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 


6. 


Claims: 1-32 all partially 






A polypeptide IMX39 represented by amino acid sequence ID 
no. 33 and DNA molecules represented by nucleic acid 
sequences ID no. 20 and no. 6. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 


7. 


Claims: 1-32 all partially 






A polypeptide IMX40 represented by amino acid sequence 
no. 34 and DNA molecules represented by nucleic acid 
sequences ID no. 7 and ID no. 21. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 
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A polypeptide IMX42 represented by amino acid sequence ID 
no. 35 and DNA molecules represented by nucleic acid 
sequences ID no. 8. Host cell. Antibody. Variants thereof. 
Therapeutic uses. 



9. Claims: 1-32 all partially 

A polypeptide IMX44 represented by amino acid sequence ID 
no. 37 and DNA molecules represented by nucleic acid 
sequences ID no. 9, no. 23 and no. 24. Host cell. Antibody. 
Variants thereof. Therapeutic uses. 



10. Claims: 1-32 all partially 

A polypeptide IMX56 represented by amino acid sequence ID 
no. 38 and DNA molecules represented by nucleic acid 
sequences ID no. 10 and no. 25. Host cell. Antibody. Variants 
thereof. Therapeutic uses. 
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NOVEL DNAs AND POLYPEPTIDES 



Reference to Related Application 

This application claims the benefit of U.S. Provisional Application S.N. 60/107,821 
filed November 10, 1998, which is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention is directed to purified and isolated novel polypeptide molecules and 
fragments thereof, the nucleic acid molecules encoding such polypeptides, processes for 
production of recombinant forms of such polypeptide molecules, antibodies generated against 
such polypeptide molecules, fragmented peptides derived from these polypeptide molecules, 
and uses thereof. In particular, the invention is directed to the use of the nucleic acid and 
polypeptide molecules in the study of inflammatory bowel diseases (IBD) 

Description of Related Art 

Damage to the intestinal epithelial barrier is a hallmark of inflammatory bowel 
diseases (IBD). Examples of inflammatory bowel diseases include ileitis, Crohn's disease (CD), 
which can affect the whole digestive tract from mouth to anus, and ulcerative colitis (UC), 
which affects only the large intestine. Studying the factors that influence the integrity of the 
epithelial barrier in vivo is a difficult task for a number of reasons that include the complexity of 
the tissue itself (there are numerous cell types in the gut including epithelial, stromal, endocrine, 
neuronal and hematopoietic) and the technical problems associated with tissue manipulation in 
animals or in isolated organs. As a result of these issues, a number of in vitro models of 
epithelial barrier function have been developed over the years. The best characterized of these 
models is the T84 intestinal epithelial barrier system (Dharmsathaphom et al., Am. J. Physiol., 
246:G204-G208, 1984 and Madara et al.,./. Cell Biol.. 101:2124-2133. 1985). T84 cells, while 
derived from a human colonic adenocarcinoma, have retained many of the properties associated 
with normal colonic crypt cells. T84 cells form polarized monolayers that exhibit high electrical 
resistance and vectorial fluid and chloride secretion reminiscent of colonic crypts in vivo. These 
properties are directly dependent on a complex of proteins referred to as tight junctions and T84 
cells express many of the known members of this complex (Youakim et al.. Submitted for 
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publication, 1998). These cells also respond to proinflammatory cytokines, such as 
mterferon-gamma (INF-gamma). by decreasing barrier function (Youakim et al.. Submitted for 
publication, 1998; Madara et al., J. Clin. Invest., 83:724-727 1989; and Adams et al., J. 
Immunol, 150:2356-2363, 1993) and by up-regulating MHC Class II molecules and antigen 
presenting activity (Hershberg et al., J. Clin. Invest., 102:792-803, 1998). This model system is 
used to examine how the epithelial barrier is regulated by various agents such as 
interferon-gamma and cells of the immune system (e.g., neutrophils). The T84 model assists in 
the elucidation of the mechanism of barrier breakdown and recovery in response to these agents 
and in the identification of proteins (and genes) that may prevent barrier breakdown or stimulate 
barrier recovery. 

Given the important function of the epithelial gut barrier and despite the growing body of 
knowledge, there is a need in the art for the discovery, the identification, and the elucidation of 
the roles of new proteins involved in gut barrier function and IBD. 

The identification of the primary structure, or sequence, of an unknown protein is the 
culmination of an arduous process of experimentation. In order to identify an unknown protein, 
the investigator can rely upon a comparison of the unknown protein to known peptides using a 
variety of techniques known to those skilled in the art. For instance, proteins are routinely 
analyzed using techniques such as electrophoresis, sedimentation, chromatography, sequencing 
and mass spectrometry. 

In particular, comparison of an unknown protein to polypeptides of known molecular 
weight allows a determination of the apparent molecular weight of the unknown protein (T.D. 
Brock and M.T. Madigan, Biology of Microorganisms 76-77 (Prentice Hall, 6d ed. 1991)). 
Protein molecular weight standards are commercially available to assist in the estimation of 
molecular weights of unknown protein (New England Biolabs Inc. Catalog:130-131, 1995; J. L. 
Hartley, U.S. Patent No. 5,449,758). However, the molecular weight standards may not 
correspond closely enough in size to the unknown protein to allow an accurate estimation of 
apparent molecular weight. The difficulty in estimation of molecular weight is compounded in 
the case of proteins that are subjected to fragmentation by chemical or enzymatic means, 
modified by post-translational modification or processing, and/or associated with other proteins 
in non-covalent complexes. 

In addition, the unique nature of the composition of a protein with regard to its specific 
amino acid constituents results in unique positioning of cleavage sites within the protein. 
Specific fragmentation of a protein by chemical or enzymatic cleavage results in a unique 
"peptide fingerprint" (D. W. Cleveland et al.J. Biol. Chem. 252:1 102-1 106, 1977; M. Brown et 
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al. ? y. Gen. Virol. 50:309-316, 1980). Consequently, cleavage at specific sites results in 
reproducible fragmentation of a given protein into peptides of precise molecular weights. 
Furthermore, these peptides possess unique charge characteristics that determine the isoelectric 
pH of the peptide. These unique characteristics can be exploited using a variety of 
electrophoretic and other techniques (T.D. Brock and M.T. Madigan, Biology of 
Microorganisms 76-77 (Prentice Hall, 6d ed. 1991)). 

Fragmentation of proteins is further employed for amino acid composition analysis and 
protein sequencing (P. Matsudiara, J. Biol Chem. 262:10035-10038, 1987; C. Eckerskorn et 
aL, Electrophoresis 1 988, 9:830-838, 1988), particularly the production of fragments from 
proteins with a "blocked" N-terminus. In addition, fragmented proteins can be used for 
immunization, for affinity selection (R. A. Brown, U.S. Patent No. 5,151,412), for determination 
of modification sites (e.g. phosphorylation), for generation of active biological compounds (T.D. 
Brock and M.T. Madigan, Biology of Microorganisms 300-301 (Prentice Hall, 6d ed. 1991)), 
and for differentiation of homologous proteins (M. Brown et al., J. Gen. Virol. 50:309-316, 
1980). 

In addition, when a peptide fingerprint of an unknown protein is obtained, it can be 
compared to a database of known proteins to assist in the identification of the unknown protein 
using mass spectrometry (W.J. Henze! et al., Proc. Natl. Acad. Sci. USA 90:501 1-5015, 1993; D. 
Fenyo et al., Electrophoresis 19:998-1005, 1998). A variety of computer software programs to 
facilitate these comparisons are accessible via the Internet, such as Protein Prospector (Internet 
site: prospector.uscf.edu), Multildent (Internet site: www.expasy.ch/sprot/multiident.html), 
PeptideSearch (Internet site: www.mann.embl-heiedelberg.de.. .deSearch/FR PeptideSearch 
Form.html), and ProFound (Internet site: www.chait-sgi.rockefeller.edu/cgi-bin/prot-id- 
frag.html). These programs allow the user to specify the cleavage agent and the molecular 
weights of the fragmented peptides within a designated tolerance. The programs compare these 
molecular weights to protein molecular weight information stored in databases to assist in 
determining the identity of the unknown protein. Accurate information concerning the number 
of fragmented peptides and the precise molecular weight of those peptides is required for 
accurate identification. Therefore, increasing the accuracy in determining the number of 
fragmented peptides and their molecular weight should result in enhanced likelihood of success 
in the identification of unknown proteins. 

In addition, peptide digests of unknown proteins can be sequenced using tandem mass 
spectrometry (MS/MS), and the resulting sequence searched against databases (J.K. Eng, et al., 
J. Am. Soc. Mass Spec. 5:976-989 (1994); M. Mann and M. Wilm, Anal. Chem. 66:4390-4399 





WO 00/28033 PCT/US99/26788 

(1 994); J .A. Taylor and R.S. Johnson. Rapid Comm. Mass Spec. 11:1 067- 1 075 ( 1 997)). 
Searching programs that can be used in this process exist on the Internet, such as Lutefisk 97 
(Internet site: www.lsbc.com:70/Lutefisk97.html), and the Protein Prospector, Peptide Search 
and ProFound programs described above. Therefore, adding the sequence of a gene and its 
predicted protein sequence and peptide fragments to a sequence database can aid in the 
identification of unknown proteins using tandem mass spectrometry. 

Thus, there also exists a need in the art for polypeptides suitable for use in peptide 
fragmentation studies, preferably, polypeptides that have altered expression in irritable bowel 
diseases, for use in molecular weight measurements, and for use in protein sequencing using 
tandem mass spectrometry. 



SUMMARY OF THE INVENTION 

Using the T84 model, it was determined that certain polypeptides have altered (up- 
regulated or down-regulated) expression patterns in response to INF-gamma. Such molecules 
may have a role in gut barrier function and IBD and may be useful as potential therapeutic 
agents in the treatment of EBD and other gut pathologies. The invention aids in fulfilling these 
needs in the art by providing isolated nucleic acids and polypeptides encoded by these nucleic 
acids that have altered expression characteristics in the T84 gut barrier model. Particular 
embodiments of the invention are directed to isolated nucleic acid molecules comprising the 
DNA sequences of SEQ ID NOs:l-26 and isolated nucleic acid molecules encoding the amino 
acid sequences of SEQ ID NOs:27-38, as well as nucleic acid molecules complementary to these 
sequences. Both single-stranded and double-stranded RNA and DNA nucleic acid molecules are 
encompassed by the invention, as well as nucleic acid molecules that hybridize to a denatured, 
double-stranded DNA comprising all or a portion of SEQ ID NOs: 1-26 and/or the DNA that 
encodes the amino acid sequences of SEQ ID NOs:27-38. Also encompassed are isolated 
nucleic acid molecules that are derived by in vitro mutagenesis from nucleic acid molecules 
comprising sequences of SEQ ID NOs: 1-26, that are degenerate from nucleic acid molecules 
comprising sequences of SEQ ID NOs: 1-26, and that are allelic variants of DNA of the 
invention. The invention also encompasses recombinant vectors that direct the expression of 
these nucleic acid molecules and host cells transformed or transfected with these vectors. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 80%, preferable 85%, more 
preferably 90%, optimally 95%, identical to a sequence of a polynucleotide selected from the 
group consisting of: 
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(a) a polynucleotide fragment of SEQ ID NO: 1-26 or a polynucleotide which is 
hybridizable to SEQ ID NO: 1-26; 

(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ID NO: 1 
26 or a polypeptide fragment encoded by the cDNA sequence which is hybridizable to SEQ ID 
NO:l-26; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ID NO: 1- 
26 or a polypeptide epitope encoded by a cDNA sequence which is hybridizable to SEQ ID 
NO: 1-26; 

(e) a polynucleotide encoding a polypeptide of a translation of SEQ ID NO: 1-26, 
having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO: 1-26; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO: 1-26; 

(h) a polynucleotide which encodes a species homologue of a translation of SEQ ID 
NO: 1-26; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 

one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not hybridize 
under stringent conditions to a nucleic acid molecule having a nucleotide sequence of only A 
residues or of only T residues. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 80%, preferable 85%, more 
preferably 90%, optimally 95%, identical to a sequence of a polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO: 1 - 1 0 or a polynucleotide which is 
hybridizable to SEQ ID NO: 1-10; 

(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ID NO: 1- 
10 or a polypeptide fragment encoded by the cDNA sequence which is hybridizable to SEQ ID 
NO:1-10; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ID NO: 1- 
10 or a polypeptide epitope encoded by a cDNA sequence which is hybridizable to SEQ ID 
NO:1-10; 

(e) a polynucleotide encoding a polypeptide of a translation of SEQ ID NO: I -10, 
having biological activity; 

(0 a polynucleotide which is a variant of SEQ ID NO: 1-10; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO: 1-10; 
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(h) a polynucleotide which encodes a species homologue of a translation of SEQ ED 
NO: 1-10; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 

one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not hybridize 
under stringent conditions to a nucleic acid molecule having a nucleotide sequence of only A 
residues or of only T residues. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 80%, preferable 85%, more 
preferably 90%, optimally 95%, identical to a sequence of a polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO: 1 1-26 or a polynucleotide which is 
hybridizable to SEQ ID NO: 1 1 -26; 

(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ID NO: 

1 1-26 or a polypeptide fragment encoded by the cDNA sequence which is hybridizable to SEQ 
ID NO: 11-26; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ID NO: 1 1- 
26 or a polypeptide epitope encoded by a cDNA sequence which is hybridizable to SEQ ED 
NO: 11-26; 

(e) a polynucleotide encoding a polypeptide of a translation of SEQ ID NO: 1 1-26, 
having biological activity; 

(0 a polynucleotide which is a variant of SEQ ID NO: 1 1-26; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO: 1 1-26; 

(h) a polynucleotide which encodes a species homologue of a translation of SEQ ID 
NO: 11-26; 

(i) a polynucleotide capable of hybridizing under stringent conditions to any 

one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not hybridize 
under stringent conditions to a nucleic acid molecule having a nucleotide sequence of only A 
residues or of only T residues. 

In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a secreted protein. In preferred embodiments, the isolated nucleic acid molecule 
comprises a nucleotide sequence encoding a polypeptide chosen from the group consisting of: 

(a) a polypeptide having the polypeptide sequence identified as a translation of 
SEQ ID NO: 1-26; 

(b) a polypeptide having the polypeptide sequence of SEQ ID NO: 27-38; and 
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(c) a polypeptide encoded by the cDNA which is hybridizable to SEQ ID NO: 1-26. 
Typically, the isolated nucleic acid molecule comprises the entire nucleotide sequence of SEQ 
ID NO: 1 -26 or a cDNA sequence which is hybridizable to SEQ ID NO: 1 -26. Typically the 
isolated nucleic acid molecule comprises sequential nucleotide deletions from portions of the 
nucleotide sequence encoding either the C-terminus or the N-terminus of the polypeptide. 

In other aspects, the invention provides a recombinant vector comprising the isolated 
nucleic acid molecule and a method of making a recombinant host cell comprising the isolated 
nucleic acid molecule and the recombinant host cell produced by such a method. 

In other preferred embodiments, the invention provides an isolated polypeptide having 
an amino acid sequence at least 80%, preferably at leasst 85%, more preferably at least 90% 
identical to the sequence of a polypeptide selected from the group consisting of: 

(a) a polypeptide fragment of a polypeptide encoded by a polynucleotide of SEQ ID 
NO: 1-26; 

(b) a polypeptide having the sequence of SEQ ID NO: 27-38; 

(c) a polypeptide domain of a polypeptide encoded by a polynucleotide of SEQ ID NO: 

1-26; 

(d) a polypeptide epitope of a polypeptide encoded by a polynucleotide of SEQ ID NO: 

1-26; 

(e) a secreted form of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-26; 

(f) a full length protein of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1- 

26; 

(g) a variant of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-26; 

(h) an allelic variant of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-26; 

and 

(i) a species homologue of a polypeptide encoded by a polynucleotide of SEQ ID NO: 

1-26. 



In some embodiments, the full length polypeptide comprises sequential amino acid 
deletions from the C-terminus. In other embodiments, the mature polypeptide comprises 
sequential amino acid deletions from the C-terminus. Alternatively, the full length polypeptide 
can comprise sequential amino acid deletions from the N-terminus or the mature polypeptide ca 
comprise sequential amino acid deletions from the N-terminus. 
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In other aspects, the invention provides an isolated antibody that binds specifically to the 
isolated polypeptide, a recombinant host cell that expresses the isolated polypeptide, and a 
method of making an isolated polypeptide comprising culturing the recombinant host cell under 
conditions such that said polypeptide is expressed; and recovering said polypeptide. 

The invention in another embodiment is a method for preventing, treating, or 
ameliorating irritable bowel discorders, comprising administering to a mammalian subject a 
therapeutically effective amount of the isolated polypeptide or the isolated nucleic acid 
molecule. In another embodiment, the invention provides a method of diagnosing an irritable 
bowel disease or a susceptibility to irritable bowel disease in a subject comprising: determining 
the presence or absence of a mutation in the isolayed nucleic acid molecule, and diagnosing an 
irritable bowel disease or a susceptibility to irritable bowel disease based on the presence or 
absence of said mutation. Alternatively, irritable bowel disease or a susceptibility to irritable 
bowel disease in a subject can be diagnosed by a method comprising: determining the presence 
or amount of expression of the polypeptide in a biological sample; and diagnosing irritable 
bowel disease or a susceptibility to irritable bowel disease based on the presence or amount of 
expression of the polypeptide. 

Typically, a binding partner of the polypeptide is identified by contacting the 
polypeptide with a binding partner; and determining whether the binding partner affects a 
physical property or an activity of the polypeptide. Typically activity in a biological assay of a 
secreted polypeptide is identified by expressing the polynucleotide of SEQ ID NO: 1-26 in a 
cell; isolating the supernatant; detecting an activity in a biological assay; and identifying the 
polypeptide in the supernatant having the activity. 

In addition, the invention encompasses methods of using the nucleic acids noted above to 
identify nucleic acids encoding proteins homologous to SEQ ID NOs:27-38; to identify human 
chromosomes that contain the nucleotide sequences of the invention; to map genes near the 
nucleotide sequences of the invention on human chromosomes; and to identify genes associated 
with certain diseases, syndromes, or other human conditions associated with human 
chromosomes containing sequences of the invention. 

For example, four of the nucleotide sequences of the invention, IMX4 and EMX56, 
IMX21, and DVIX44 are located on chromosomes 22, 22, 7, and 19, respectively (IMX4 and 
IMX56 are both located on chromosome 22). Thus, the above-named nucleotide sequences 
(IMX4 ? IMX56, IMX21. and IMX44) can be used to identify human chromosome numbers 22, 

-8- 





WO 00/28033 PCT/US99/26788 
7, and 19: to map genes on human chromosome numbers 22, 7, and 19; and to identify genes 
associated with certain diseases, syndromes, or other human conditions associated with human 
chromosome numbers 22, 7, and 19. 

The invention also encompasses the use of sense or antisense oligonucleotides from the 
nucleic acids of SEQ ID NOs: 1 -26 to inhibit the expression of the polynucleotides encoded by 
the nucleotide sequences of the invention. 

The invention also encompasses isolated polypeptides and fragments thereof encoded by 
these nucleic acid molecules including soluble polypeptide portions of SEQ ID NOs:27-38. The 
invention further encompasses methods for the production of these polypeptides, including 
cultunng a host cell under conditions promoting expression and recovering the polypeptide from 
the culture medium if it is secreted or from cultured cells if it is not secreted. Especially, the 
expression of these polypeptides in bacteria, yeast, plant, insect, and animal cells is 
encompassed by the invention. 

In addition, the invention includes assays utilizing these polypeptides, to screen for 
potential inhibitors of activity associated with polypeptide counter-structure molecules, and 
methods of using these polypeptides as therapeutic agents for the treatment of diseases mediated 
by polypeptide counter-structure molecules. Further, methods of using these polypeptides in the 
design of inhibitors thereof are also an aspect of the invention. 

The invention further includes a method for using these polypeptides as molecular 
weight markers that allow the estimation of the molecular weight of a protein or a fragmented 
protein, as well as a method for the visualization of the molecular weight markers of the 
invention thereof using electrophoresis. The invention further encompasses methods for using 
the polypeptides of the invention as markers for determining the isoelectric point of an unknown 
protein, as well as controls for establishing the extent of fragmentation of a protein. 

Further encompassed by this invention are kits to aid in these determinations. 
Further encompassed by this invention is the use of the IMX nucleic acid sequences, 
predicted amino acid sequences of the polypeptide or fragments thereof, or a combination of the 
predicted amino acid sequences of the polypeptides and fragments thereof for use in searching 
an electronic database to aid in the identification of sample nucleic acids and/or proteins. 

The invention also encompasses IMX polypeptides and the use of these polypeptides as 
research reagents to further study gut epithelial barrier function and regulation and therapeutic 
reagents to treat inflammatory bowel disease and other gut pathologies. 
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Isolated polyclonal or monoclonal antibodies that bind to these polypeptides are also 
encompassed by the invention, in addition the use of these antibodies to aid in purifying IMX 
polypeptides. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents the nucleotide sequence of IMX 4 (SEQ ID NO: 1). 

Figure 2 presents the nucleotide sequence of IMX 10 (SEQ ID NO: 14). 

Figure 3 presents the nucleotide sequence of IMX 21 (SEQ ID NO: 15). 

Figure 4 presents the nucleotide sequence of IMX 28 (SEQ ID NO: 16). 

Figure 5 presents the nucleotide sequence of IMX 32 (SEQ ID NO: 17). 

Figure 6 presents the nucleotide sequence of IMX 39 (SEQ ID NO: 20). 

Figure 7 presents the nucleotide sequence of IMX 40 (SEQ ID NO: 7), 

Figure 8 presents the nucleotide sequence of IMX 42 (SEQ ID NO: 8). 

Figure 9 presents the nucleotide sequence of IMX 44 (SEQ ID NO: 23). 

Figure 10 presents the nucleotide sequence of IMX 56 (SEQ ID NO: 10). 

Figure 1 1 presents the amino acid sequence of IMX 4 (SEQ ID NO: 27). 

Figure 12 presents the amino acid sequence of IMX 10 (SEQ ID NO: 28). 

Figure 13 presents the amino acid sequence of IMX 21 (SEQ ID NO: 29). 

Figure 14 presents the amino acid sequence of IMX 28 (SEQ ID NO: 30). 

Figure 15 presents the amino acid sequence of IMX 32 (SEQ ID NO: 32). 

Figure 16 presents the amino acid sequence of IMX 39 (SEQ ID NO: 33). 

Figure 17 presents the amino acid sequence of IMX 40 (SEQ ID NO: 34). 

Figure 18 presents the amino acid sequence of IMX 42 (SEQ ID NO: 35). 

Figure 19 presents the amino acid sequence of IMX 44 (SEQ ID NO: 37). 

Figure 20 presents the amino acid sequence of IMX 56 (SEQ ID NO: 38). 

Figure 21 presents the nucleotide sequence of the 5' end of the clone (SEQ ID NO: 13) 
matched to part of the human ApoL gene (AF0 19225). 

Figure 22 presents comparison of the 5' end of the clone suggests it represents an 
alternative splice product to reported ApoL, L, bases 1-168 match to 2 exons on PAC carrying 
the ApoL gene but are not included in the reported complete cDNA. 
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DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

The following definitions are provided to facilitate understanding of certain terms used 
throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered "by the 
hand of man" from its natural state. For example, an isolated polynucleotide could be part of a 
vector or a composition of matter, or could be contained within a cell, and still be "isolated" 
because that vector, composition of matter, or particular cell is not the original environment of 
the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable of being 
directed to the ER, secretory vesicles, or the extracellular space as a result of a signal sequence, 
as well as those proteins released into the extracellular space without necessarily containing a 
signal sequence. If the secreted protein is released into the extracellular space, the secreted 
protein can undergo extracellular processing to produce a "mature" protein. Release into the 
extracellular space can occur by many mechanisms, including exocytosis and proteolytic 
cleavage. 

* 

As used herein, a "polynucleotide" refers to a molecule having a nucleic acid sequence 
contained in SEQ ID NO: 1-26. For example, the polynucleotide can contain all or part of the 
nucleotide sequence of the full length cDNA sequence, including the 5' and 3' untranslated 
sequences, the coding region, with or without the signal sequence, the secreted protein coding 
region, as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated amino acid 
sequence generated from the polynucleotide as broadly defined. 

A "polynucleotide" of the present invention also includes those polynucleotides capable 
of hybridizing, under stringent hybridization conditions, to sequences contained in SEQ ID 
NO: 1 -26, or the complement thereof, or the cDNA. "Stringent hybridization conditions" refers 
to an overnight incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 mM 
NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 jig/ml denatured, sheared salmon sperm DNA, followed by washing the 
filters in 0. 1 x SSC at about 65°C. 

Also contemplated are nucleic acid molecules that hybridize to the polynucleotides of the 
present invention at lower stringency hybridization conditions. Changes in the stringency of 
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hybridization and signal detection are primarily accomplished through the manipulation of 
formamide concentration (lower percentages of formamide result in lowered stringency); salt 
conditions, or temperature. For example, lower stringency conditions include an overnight 
incubation at 37°C in a solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH 2 P0 4 ; 
0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; 
followed by washes at 50°C with 1 XSSPE, 0. 1 % SDS. In addition, to achieve even lower 
stringency, washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the inclusion 
and/or substitution of alternate blocking reagents used to suppress background in hybridization 
experiments. Typical blocking reagents include Denhardt's reagent, BLOTTO, heparin, 
denatured salmon sperm DNA, and commercially available proprietary formulations. The 
inclusion of specific blocking reagents may require modification of the hybridization conditions 
described above, due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 3' 
terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary stretch 
of T (or U) residues, would not be included in the definition of "polynucleotide," since such a 
polynucleotide would hybridize to any nucleic acid molecule containing a poly (A) stretch or the 
complement thereof (e.g., practically any double-stranded cDNA clone). 

The polynucleotide of the present invention can be composed of any polyribonucleotide 
or polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
For example, polynucleotides can be composed of single- and double-stranded DNA, DNA that 
is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA 
that is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and 
RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and 
double-stranded regions. In addition, the polynucleotide can be composed of triple-stranded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also contain 
one or more modified bases or DNA or RNA backbones modified for stability or for other 
reasons. "Modified" bases include, for example, tritylated bases and unusual bases such as 
inosine. A variety of modifications can be made to DNA and RNA; thus, "polynucleotide" 
embraces chemically, enzymaticaily, or metabolically modified forms. 

The polypeptide of the present invention can be composed of amino acids joined to each 
other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. and may contain amino 
acids other than the 20 gene-encoded amino acids. The polypeptides may be modified by either 
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natural processes, such as posttranslational processing, or by chemical modification techniques 
which are well known in the art. Such modifications are well described in basic texts and in 
more detailed monographs, as well as in a voluminous research literature. Modifications can 
occur anywhere in a polypeptide, including the peptide backbone, the amino acid side-chains 
and the amino or carboxyl termini. It will be appreciated that the same type of modification may 
be present in the same or varying degrees at several sites in a given polypeptide. Also, a given 
polypeptide may contain many types of modifications. Polypeptides may be branched, for 
example, as a result of ubiquitination. and they may be cyclic, with or without branching. 
Cyclic, branched, and branched cyclic polypeptides may result from posttranslation natural 
processes or may be made by synthetic methods. Modifications include acetylation, acylation, 
ADP-ribosylation. amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a 
lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, 
disulfide bond formation, demethylation, formation of covalent cross-links, formation of 
cysteine, formation of pyroglutamate, formulation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins such as arginylation, and 
ubiquitination. (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 
PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, Ed., 
Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626-646 (1990); 
Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"A polypeptide having biological activity" refers to polypeptides exhibiting activity 
similar, but not necessarily identical to, an activity of a polypeptide of the present invention, 
including mature forms, as measured in a particular biological assay, with or without dose 
dependency. In the case where dose dependency does exist, it need not be identical to that of the 
polypeptide, but rather substantially similar to the dose-dependence in a given activity as 
compared to the polypeptide of the present invention (i.e., the candidate polypeptide will exhibit 
greater activity or not more than about 25-fold less and, preferably, not more than about tenfold 
less activity, and most preferably, not more than about three-fold less activity relative to the 
polypeptide of the present invention.). 

The translated amino acid sequence, beginning with the methionine, is identified 
although other reading frames can also be easily translated using known molecular biology 
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techniques. The polypeptides produced by the translation of these alternative open reading 
frames are specifically contemplated by the present invention. 

SEQ ID NO: 1-26 and the translations of SEQ ID NO: 1-26 as well as SEQ ID NO.27-38 
are sufficiently accurate and otherwise suitable for a variety of uses well known in the art and 
described further below. These probes will also hybridize to nucleic acid molecules in biological 
samples, thereby enabling a variety of forensic and diagnostic methods of the invention. 
Similarly, polypeptides identified from the translations of SEQ ID NO: 1-26 may be used to 
generate antibodies which bind specifically to the secreted proteins encoded by the cDNA clones 
identified. 

Nevertheless, DNA sequences generated by sequencing reactions can contain sequencing 
errors. The errors exist as misidentified nucleotides, or as insertions or deletions of nucleotides 
in the generated DNA sequence. The erroneously inserted or deleted nucleotides cause frame 
shifts in the reading frames of the predicted amino acid sequence. In these cases, the predicted 
amino acid sequence diverges from the actual amino acid sequence, even though the generated 
DNA sequence may be greater than 99.9% identical to the actual DNA sequence (for example, 
one base insertion or deletion in an open reading frame of over 1000 bases). 

The present invention also relates to the genes corresponding to SEQ ID NO: 1 -26, and 
translations of SEQ ID NO: 1-26. The corresponding gene can be isolated in accordance with 
known methods using the sequence information disclosed herein. Such methods include 
preparing probes or primers from the disclosed sequence and identifying or amplifying the 
corresponding gene from appropriate sources of genomic material. 

Also provided in the present invention are species homologues. Species homologues 
may be isolated and identified by making suitable probes or primers from the sequences 
provided herein and screening a suitable nucleic acid source for the desired homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a combination 
of these methods. Means for preparing such polypeptides are well understood in the art. 

The polypeptides may be in the form of the secreted protein, including the mature form, 
or may be a part of a larger protein, such as a fusion protein (see below). It is often 
advantageous to include an additional amino acid sequence which contains secretorv or leader 
sequences, pro-sequences, sequences which aid in purification, such as multiple histidine 
residues, or an additional sequence for stability during recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated form, 
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and preferably are substantially purified. A recombinantly produced version of a polypeptide, 
including the secreted polypeptide, can be substantially purified by the one-step method 
described in Smith and Johnson, Gene 67:31-40 (1988). Polypeptides of the invention also can 
be purified from natural or recombinant sources using antibodies of the invention raised against 
the secreted protein in methods which are well known in the an. 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the cleavage 
point for that sequence, are available. For instance, the method of McGeoch, Virus Res. 3:271- 
286 (1985), uses the information from a short N-terminal charged region and a subsequent 
uncharged region of the complete (uncleaved) protein. The method of von Heinje, Nucleic 
Acids Res. 14:4683-4690 (1986) uses the information from the residues surrounding the 
cleavage site, typically residues -13 to +2, where +1 indicates the amino terminus of the secreted 
protein. Therefore, from a deduced amino acid sequence, a signal sequence and mature 
sequence can be identified. 

Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the polynucleotide or 
polypeptide of the present invention, but retaining essential properties thereof. Generally, 
variants are overall closely similar, and, in many regions, identical to the polynucleotide or 
polypeptide of the present invention. 

Further embodiments of the present invention include polynucleotides having at least 
80% identity, more preferably at least 90% identity, and most preferably at least 95%, 96%, 
97%, 98% or 99% identity to a sequence contained in SEQ ID NO: 1-26. Of course, due to the 
degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a 
large number of the polynucleotides having at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% 
identity will encode a polypeptide identical to an amino acid sequence contained in the 
translations of SEQ ID NO: 1 -26. 

Similarly, by a polypeptide having an amino acid sequence having at least, for example, 
95% "identity" to a reference polypeptide, is intended that the amino acid sequence of the 
polypeptide is identical to the reference polypeptide except that the polypeptide sequence may 
include up to five amino acid alterations per each 100 amino acids of the total length of the 
reference polypeptide. In other words, to obtain a polypeptide having an amino acid sequence at 
least 95% identical to a reference amino acid sequence, up to 5% of the amino acid residues in 
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the reference sequence may be deleted or substituted with another amino acid, or a number of 
amino acids up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur at the amino 
or carboxy terminal positions of the reference amino acid sequence or anywhere between those 
terminal positions, interspersed either individually among residues in the reference sequence or 
in one or more contiguous groups within the reference sequence. 

Further embodiments of the present invention include polypeptides having at least 80% 
identity, more preferably at least 85% identity, more preferably at least 90% identity, and most 
preferably at least 95%, 96%, 97%, 98% or 99% identity to an amino acid sequence contained 
in translations of SEQ ID NO: 1-26. Preferably, the above polypeptides should exhibit at least 
one biological activity of the protein. 

In a preferred embodiment, polypeptides of the present invention include polypeptides 
having at least 90% similarity, more preferably at least 95% similarity, and still more preferably 
at least 96%, 97%, 98%, or 99% similarity to an amino acid sequence contained in translations 
of SEQ ID NO: 1-26 as well as the amino acid sequences of SEQ ID NO:27-38. 

The variants may contain alterations in the coding regions, non-coding regions, or both. 
Especially preferred are polynucleotide variants containing alterations which produce silent 
substitutions, additions, or deletions, but do not alter the properties or activities of the encoded 
polypeptide. Nucleotide variants produced by silent substitutions due to the degeneracy of the 
genetic code are preferred. Moreover, variants in which 5-10, 1-5, or 1-2 amino acids are 
substituted, deleted, or added in any combination are also preferred. Polynucleotide variants can 
be produced for a variety of reasons, e.g., to optimize codon expression for a particular host 
(change codons in the human mRNA to those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of several 
alternate forms of a gene occupying a given locus on a chromosome of an organism. (Genes II, 
Lewin, B., ed., John Wiley & Sons, New York (1985).) These allelic variants can vary at either 
the polynucleotide and/or polypeptide level. Alternatively, non-naturally occurring variants may 
be produced by mutagenesis techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA technology, 
variants may be generated to improve or alter the characteristics of the polypeptides of the 
present invention. For instance, one or more amino acids can be deleted from the N-terminus or 
C-terminus of the secreted protein without substantial loss of biological function. The authors of 
Ron et al., J. Biol. Chem. 268: 2984-2988 (1993) reported variant KGF proteins having heparin 
binding activity even after deleting 3, 8. or 27 amino-terminal amino acid residues. Similarly 
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Interferon gamma exhibited up to ten times higher activity after deleting 8-10 amino acid 
residues from the carboxy terminus of this protein. (Dobeli et aL J. Biotechnology 7:199-216 
(1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological activity 
similar to that of the naturally occurring protein. For example, Gayle and coworkers (J. Biol. 
Chem 268:22105-221 1 1 (1993)) conducted extensive mutational analysis of human cytokine IL- 
1 a. They used random mutagenesis to generate over 3,500 individual IL- 1 a mutants that 
averaged 2.5 amino acid changes per variant over the entire length of the molecule. Multiple 
mutations were examined at every possible amino acid position. The investigators found that 
M [m]ost of the molecule could be altered with little effect on either [binding or biological 
activity]." (See Gayle et al., (1993), Abstract.) In fact, only 23 unique amino acid sequences, out 
of more than 3,500 nucleotide sequences examined, produced a protein that significantly 
differed in activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or C- 
tenrtinus of a polypeptide results in modification or loss of one or more biological functions, 
other biological activities may still be retained. For example, the ability of a deletion variant to 
induce and/or to bind antibodies which recognize the secreted form will likely be retained when 
less than the majority of the residues of the secreted form are removed from the N-terminus or 
C-terminus. Whether a particular polypeptide lacking N- or C-terminal residues of a protein 
retains such immunogenic activities can readily be determined by routine methods described 
herein and otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show substantial 
biological activity. Such variants include deletions, insertions, inversions, repeats, and 
substitutions selected according to general rules known in the art so as have little effect on 
activity. For example, guidance concerning how to make phenotypically silent amino acid 
substitutions is provided in Bowie, J. U. et al., Science 247:1306-1310 (1990), wherein the 
authors indicate that there are two main strategies for studying the tolerance of an amino acid 
sequence to change. As the authors state, these two strategies, natural selection and genetic 
engineering, have revealed that proteins are surprisingly tolerant of amino acid substitutions. 
The authors further indicate which amino acid changes are likely to be permissive at certain 
ammo acid positions in the protein. For example, most buried (within the tertiary structure of 
the protein) amino acid residues require nonpolar side chains, whereas few features of surface 
side chains are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and He; 
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replacement of the hydroxyl residues Ser and Thr: replacement of the acidic residues Asp and 
Glu; replacement of the amide residues Asn and Gin. replacement of the basic residues Lvs Ar» 
and His; replacement of the aromatic residues Phe. Tyr, and Trp, and replacement of the small- 
sized amino acids Ala. Ser, Thr. Met. and Glv. 

Besides conservative amino acid substitution, variants of the present invention include (i) 
substitutions with one or more of the non-conserved amino acid residues, where the substituted 
amino acid residues may or may not be one encoded by the genetic code, or (ii) substitution with 
one or more of amino acid residues having a substituent group, or (iii) fusion of the mature 
polypeptide with another compound, such as a compound to increase the stability and/or 
solubility of the polypeptide (for example, polyethylene glycol), or (iv) fusion of the polypeptide 
with additional amino acids, such as an IgG Fc fusion region peptide, or leader or secretory 
sequence, or a sequence facilitating purification. Such variant polypeptides are deemed to be 
within the scope of those skilled in the art from the teachings herein. 

For example, polypeptide variants containing amino acid substitutions of charged amino 
acids with other charged or neutral amino acids may produce proteins with improved 
characteristics, such as less aggregation. Aggregation of pharmaceutical formulations both 
reduces activity and increases clearance due to the aggregate's immunogenic activity. (Pinckard 
et al., Clin. Exp. Immunol. 2:331-340 (1967); Robbins et al.. Diabetes 36: 838-845 (1987); 
Cleland et al., Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377 (1993).) 

Polynucleotide and Polypeptide Fragments 

In the present invention, a "polynucleotide fragment" refers to a short polynucleotide 
having a nucleic acid sequence contained in that shown in SEQ ID NO: 1-26. The short 
nucleotide fragments are preferably at least about 15 nt, and more preferably at least about 20 nt, 
still more preferably at least about 30 nt, and even more preferably, at least about 40 nt in length. 
A fragment "at least 20 nt in length," for example, is intended to include 20 or more contiguous 
bases from the cDNA sequence contained in that shown in SEQ ID NO: 1-26. These nucleotide 
fragments are useful as diagnostic probes and primers as discussed herein. Of course, larger 
fragments (e.g., 50, 150, and more nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the invention, 
include, for example, fragments having a sequence from about nucleotide number 1-50, 51-260, 
101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401-450. and so forth, to the end of 
SEQ ID NO: 1-26. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5. 4, 3, 2. or 1 ) nucleotides, at either terminus or at both termini. Preferably, 
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these fragments encode a polypeptide which has biological activity. 

In the present invention, a "polypeptide fragment" refers to a short amino acid sequence 
contained in the translations of SEQ ID NO: 1-26 as well as SEQ ID NO:27-38. Protein 
fragments may be "free-standing," or comprised within a larger polypeptide of which the 
fragment forms a pan or region, most preferably as a single continuous region. Representative 
examples of polypeptide fragments of the invention, include, for example, fragments from about 
amino acid number 1-20, 21-40 ? 41-60, and so forth, to the end of the coding region. Moreover, 
polypeptide fragments can be about 20, 30, 40, 50 or 60, amino acids in length. In this context 
"about" includes the particularly recited ranges, larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the mature form. 
Further preferred polypeptide fragments include the secreted protein or the mature form having a 
continuous series of deleted residues from the amino or the carboxy terminus, or both. For 
example, any number of amino acids, ranging from 1-60, can be deleted from the amino 
terminus of either the secreted polypeptide or the mature form. Similarly, any number of amino 
acids, ranging from 1-30, can be deleted from the carboxy terminus of the secreted protein or 
mature form. Furthermore, any combination of the above amino and carboxy terminus deletions 
are preferred. Similarly, polynucleotide fragments encoding these polypeptide fragments are 
also preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by structural 
or functional domains, such as fragments that comprise alpha-helix and alpha-helix forming 
regions, beta-sheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, and 
high antigenic index regions. Polypeptide fragments of the translations of SEQ ID NO: 1-26 as 
well as SEQ ID NO:27-38 falling within conserved domains are specifically contemplated by 
the present invention. Moreover, polynucleotide fragments encoding these domains are also 
contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 
fragments are those exhibiting activity similar, but not necessarily identical, to an activity of the 
polypeptide of the present invention. The biological activity of the fragments may include an 
improved desired activity, or a decreased undesirable activity. 



Epitopes & Antibodies 
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In the present invention, "epitopes" refer to polypeptide fragments having antigenic or 
immunogenic activity in an animal, especially in a human. A preferred embodiment of the 
present invention relates to a polypeptide fragment comprising an epitope, as well as the 
polynucleotide encoding this fragment. A region of a protein molecule to which an antibody can 
bind is defined as an "antigenic epitope." In contrast, an "immunogenic epitope" is defined as a 
part of a protein that elicits an antibody response. (See, for instance, Geysen et al., Proc. Natl. 
Acad. Sci. USA 81 :3998-4002 (1983).) 

Fragments which function as epitopes may be produced by any conventional means. 
(See, e.g., Houghten, R. A., Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) further 
described in U.S. Patent No. 4,631,211.) 

In the present invention, antigenic epitopes preferably contain a sequence of at least 
seven, more preferably at least nine, and most preferably between about 15 to about 30 amino 
acids. Antigenic epitopes are useful to raise antibodies, including monoclonal antibodies, that 
specifically bind the epitope. (See, for instance, Wilson et al., Cell 37:767-778 (1984); Sutcliffe, 
J. G. et al., Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to methods 
well known in the art. (See, for instance, Sutcliffe et ah, supra; Wilson et al., supra; Chow, M. et 
al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et al., J. Gen. Virol. 66:2347- 
2354 (1985).) A preferred immunogenic epitope includes the secreted protein. The 
immunogenic epitopes may be presented together with a carrier protein, such as an albumin, to 
an animal system (such as rabbit or mouse) or, if it is long enough (at least about 25 amino 
acids), without a carrier. However, immunogenic epitopes comprising as few as 8 to 10 amino 
acids have been shown to be sufficient to raise antibodies capable of binding to, at the very least, 
linear epitopes in a denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to 
include intact molecules as well as antibody fragments (such as, for example. Fab and F(ab')2 
fragments) which are capable of specifically binding to protein. Fab and F(ab , )2 fragments lack 
the Fc fragment of intact antibody, clear more rapidly from the circulation, and may have less 
non-specific tissue binding than an intact antibody. (Wahl et al., J. Nucl. Med. 24:3 16-325 
(1983).) Thus, these fragments are preferred, as well as the products of a FAB or other 
immunoglobulin expression library. Moreover, antibodies of the present invention include 
chimeric, single chain, and humanized antibodies. 



Fusion Proteins 
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Any polypeptide of the present invention can be used to generate fusion proteins. For 
example, the polypeptide of the present invention, when fused to a second protein, can be used 
as an antigenic tag. Antibodies raised against the polypeptide of the present invention can be 
used to indirectly detect the second protein by binding to the polypeptide. Moreover, because 
secreted proteins target cellular locations based on trafficking signals, the polypeptides of the 
present invention can be used as targeting molecules once fused to other proteins. 

Examples of domains that can be fused to polypeptides of the present invention include 
not only heterologous signal sequences, but also other heterologous functional regions. The 
fusion does not necessarily need to be direct, but may occur through linker sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of the 
polypeptide of the present invention. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the polypeptide to improve 
stability and persistence during purification from the host cell or subsequent handling and 
storage. Also, peptide moieties may be added to the polypeptide to facilitate purification. Such 
regions may be removed prior to final preparation of the polypeptide. The addition of peptide 
moieties to facilitate handling of polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and specifically 
epitopes, can be combined with parts of the constant domain of immunoglobulins (IgG), 
resulting in chimeric polypeptides. These fusion proteins facilitate purification and show an 
increased half-life in vivo. One reported example describes chimeric proteins consisting of the 
first two domains of the human CD4-polypeptide and various domains of the constant regions of 
the heavy or light chains of mammalian immunoglobulins. (EP A 394,827; Traunecker et al., 
Nature 331:84-86 (1988).) Fusion proteins having disulfide-linked dimeric structures (due to the 
IgG) can also be more efficient in binding and neutralizing other molecules, than the monomelic 
secreted protein or protein fragment alone. (Fountoulakis et al., J. Biochem. 270:3958-3964 
(1995).) 

Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion proteins 
comprising various portions of constant region of immunoglobulin molecules together with 
another human protein or part thereof. In many cases, the Fc part in a fusion protein is 
beneficial in therapy and diagnosis, and thus can result in, for example, improved 
pharmacokinetic properties. (EP-A 0 232 262.) Alternatively, deleting the Fc part after the 
fusion protein has been expressed, detected, and purified, would be desired. For example, the Fc 
portion may hinder therapy and diagnosis if the fusion protein is used as an antigen for 
immunizations. In drug discovery, for example, human proteins, such as hIL-5, have been fused 
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with Fc portions for the purpose of high-throughput screening assays to identify antagonists of 
hIL-5. (See, D. Bennett et ah. J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. 
Biol. Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker sequences, 
5 such as a peptide which facilitates purification of the fused polypeptide. In preferred 

embodiments, the marker amino acid sequence is a hexa-histidine peptide, such as the tag 
provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 9131 1), among 
others, many of which are commercially available. As described in Gentz et aL, Proc. Natl. 
Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides for convenient 
10 purification of the fusion protein. Another peptide tag useful for purification, the "HA" tag, 

corresponds to an epitope derived from the influenza hemagglutinin protein. (Wilson et al., Cell 
37:767(1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

15 

Vectors. Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of the present 
invention, host cells, and the production of polypeptides by recombinant techniques. The vector 
may be, for example, a phage, plasmid, viral, or retroviral vector. Retroviral vectors may be 
20 replication competent or replication defective. In the latter case, viral propagation generally will 
occur only in complementing host cells. 



25 SEQIDNO: 1-26. 

The amino acid sequences of the polypeptides encoded by the nucleotide sequences of 

the invention are given in SEQ ID NO:27-38. 

The discovery of the nucleic acids of the invention enables the construction of expression 

vectors comprising nucleic acid sequences encoding polypeptides; host cells transfected or 

30 transformed with the expression vectors; isolated and purified biologically active polypeptides 

and fragments thereof; the use of the nucleic acids or oligonucleotides thereof as probes to 

identify nucleic acids encoding proteins.having amino acid sequences homologous to SEQ ID 

NOs: 27-38; the use of the nucleic acids or oligonucleotides thereof to identify human 

chromosomes, for example, 22, 7, and 19; the use of the nucleic acids or oligonucleotides 



Nucleic Acid Molecules and Polypeptides of the Present Invention 

The nucleic acid molecules encompassed in the invention comprise nucleotide sequences 
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thereof to map genes on human chromosomes, for example, 22. 7, and 19; the use of the nucleic 
acids or oligonucleotides thereof to identify genes associated with certain diseases, syndromes or 
other human conditions associated with human chromosomes such as 22, 7, and 19; the use of 
single-stranded sense or antisense oligonucleotides from the nucleic acids to inhibit expression 
of polynucleotides encoded by the IMX sequences; the use of such polypeptides and soluble 
fragments as molecular weight markers; the use of such polypeptides and fragmented peptides as 
controls for peptide fragmentation, and kits comprising these reagents; the use of such 
polypeptides and fragments thereof to generate antibodies, and the use of antibodies to purify 
IMX polypeptides. 

NUCLEIC ACID MOLECULES 

In a particular embodiment, the invention relates to certain isolated nucleotide sequences 
that are free from contaminating endogenous material. A "nucleotide sequence" refers to a 
polynucleotide molecule in the form of a separate fragment or as a component of a larger nucleic 
acid construct. The nucleic acid molecule has been derived from DNA or RNA isolated at least 
once in substantially pure form and in a quantity or concentration enabling identification, 
manipulation, and recovery of its component nucleotide sequences by standard biochemical 
methods (such as those outlined in Sambrook et ah, Molecular Cloning: A Laboratory Manual, 
2nd sed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989)). Such sequences are 
preferably provided and/or constructed in the form of an open reading frame uninterrupted by 
internal non-translated sequences, or introns, that are typically present in eukaryotic genes. 
Sequences of non-translated DNA can be present 5* or 3' from an open reading frame, where the 
same do not interfere with manipulation or expression of the coding region. 

Nucleic acid molecules of the invention include DNA in both single-stranded and 
double-stranded form, as well as the RNA complement thereof. DNA includes, for example, 
cDNA, genomic DNA, chemically synthesized DNA, DNA amplified by PCR, and 
combinations thereof Genomic DNA may be isolated by conventional techniques, e.g., using 
the cDNA of SEQ ID NOs: 1-26, or a suitable fragment thereof, as a probe. 

The DNA molecules of the invention include full length genes as well as polynucleotides 
and fragments thereof The full length gene may include the N-terminal signal peptide. Other 
embodiments include DNA encoding a soluble form, e.g., encoding the extracellular domain of 
the protein, either with or without the signal peptide. 

The nucleic acids of the invention are preferentially derived from human sources, but the 
invention includes those derived from non-human species, as well. 
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Preferred Sequences 

The particularly preferred nucleotide sequences of the invention are SEQ ID NOs: 1-26, 
as set forth above. cDNA clones having the nucleotide sequence of SEQ ID NOs: 1-26 were 
5 isolated as described in Example 1. The sequences of amino acids encoded by the DNA of SEQ 
ID NOs: 1-26 are shown in SEQ ID NOs:27-38. 

Additional Sequences 

Due to the known degeneracy of the genetic code, wherein more than one codon can 
10 encode the same amino acid, a DNA sequence can vary from that shown in SEQ ID NOs: 1-26, 
and still encode a polypeptide having the amino acid sequence of SEQ ID NOs:27-38. Such 
variant DNA sequences can result from silent mutations (e.g., occurring during PCR 
amplification), or can be the product of deliberate mutagenesis of a native sequence. 

The invention thus provides additional isolated DNA sequences encoding polypeptides 
15 of the invention, selected from: (a) DNA comprising the nucleotide sequences of SEQ ID NOs: 
1-26; (b) DNA encoding the polypeptides of SEQ ID NOs:27-38; (c) DNA capable of 
hybridization to a DNA of (a) or (b) under conditions of moderate stringency and which encodes 
polypeptides of the invention; (d) DNA capable of hybridization to a DNA of (a) or (b) under 
conditions of high stringency and which encodes polypeptides of the invention, and (e) DNA 
20 which is degenerate, as a result of the genetic code, to a DNA defined in (a), (b), (c), or (d) and 
which encode polypeptides of the invention. Of course, polypeptides encoded by such DNA 
sequences are encompassed by the invention. 

As used herein, conditions of moderate stringency can be readily determined by those 
having ordinary skill in the art based on, for example, the length of the DNA. The basic 
25 conditions are set forth by Sambrook et al. Molecular Cloning: A Laboratory Manual, 2 ed. 
Vol. 1, pp. 1.101-264, Cold Spring Harbor Laboratory Press, (1989), and include use of a 
prewashing solution for the nitrocellulose filters 5X SSC, 0.5% SDS, 1 .0 mM EDTA (pH 8.0), 
hybridization conditions of about 50% formamide, 6X SSC at about 42°C (or other similar 
hybridization solution, such as Stark's solution, in about 50% formamide at about 42°C), and 
30 washing conditions of about 60°C, 0.5X SSC, 0.1% SDS. Conditions of high stringency can 
also be readily determined by the skilled artisan based on. for example, the length of the DNA. 
Generally, such conditions are defined as hybridization conditions as above, and with washing at 
approximately 68°C, 0.2X SSC, 0. 1% SDS. The skilled artisan will recognize that the 
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temperature and wash solution salt concentration can be adjusted as necessary according to 
factors such as the length of the probe. 

Also included as an embodiment of the invention is DNA encoding polypeptide 
fragments and polypeptides comprising inactivated N-glycosylation site(s), inactivated protease 
processing site(s), or conservative amino acid substitution(s), as described below. 

In another embodiment, the nucleic acid molecules of the invention also comprise 
nucleotide sequences that are at least 80% identical to a native sequence. Also contemplated are 
embodiments in which a nucleic acid molecule comprises a sequence that is at least 90% 
identical, at least 95% identical, at least 98% identical, at least 99% identical, or at least 99.9% 
identical to a native sequence. 

The percent identity may be determined by visual inspection and mathematical 
calculation. Alternatively, the percent identity of two nucleic acid sequences can be determined 
by comparing sequence information using the GAP computer program, version 6.0 described by 
Devereux et al. (NucL Acids Res. 12:387, 1984) and available from the University of Wisconsin 
Genetics Computer Group (UWGCG). The preferred default parameters for the GAP program 
include: (1) a unary comparison matrix (containing a value of 1 for identities and 0 for non- 
identities) for nucleotides, and the weighted comparison matrix of Gribskov and Burgess, NucL 
Acids Res. 74:6745, 1986, as described by Schwartz and Dayhoff, eds., Atlas of Protein 
Sequence and Structure, National Biomedical Research Foundation, pp. 353-358, 1979; (2) a 
penalty of 3.0 for each gap and an additional 0.10 penalty for each symbol in each gap; and (3) 
no penalty for end gaps. Other programs used by one skilled in the art of sequence comparison 
may also be used. 

The invention provides isolated nucleic acids useful in the production of polypeptides. 
Such polypeptides may be prepared by any of a number of conventional techniques. A DNA 
sequence encoding an IMX polypeptide or a desired fragment thereof may be subcloned into an 
expression vector for production of the polypeptide or fragment. The DNA sequence 
advantageously is fused to a sequence encoding a suitable leader or signal peptide. 
Alternatively, the desired fragment may be chemically synthesized using known techniques. 
DNA fragments also may be produced by restriction endonuclease digestion of a full length 
cloned DNA sequence, and isolated by electrophoresis on agarose gels. If necessary, 
oligonucleotides that reconstruct the 5' or 3* terminus to a desired point may be ligated to a DNA 
fragment generated by restriction enzyme digestion. Such oligonucleotides may additionally 
contain a restriction endonuclease cleavage site upstream of the desired coding sequence, and 
position an initiation codon (ATG) at the N-terminus of the coding sequence. 
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The well-known polymerase chain reaction (PCR) procedure also may be employed to 
isolate and amplify a DNA sequence encoding a desired protein fragment. Oligonucleotides that 
define the desired termini of the DNA fragment are employed as 5' and 3' primers. The 
oligonucleotides may additionally contain recognition sites for restriction endonucleases, to 
facilitate insertion of the amplified DNA fragment into an expression vector. PCR techniques 
are described in Saiki et al., Science 239:487 (1988); Recombinant DNA Methodology, Wu et al., 
eds., Academic Press, Inc., San Diego (1989), pp. 189-196; and PCR Protocols: A Guide to 
Methods and Applications, Innis et al., eds., Academic Press, Inc. (1990). 

POLYPEPTIDES AND FRAGMENTS THEREOF 

The invention encompasses polypeptides and fragments thereof in various forms, 
including those that are naturally occurring or produced through various techniques such as 
procedures involving recombinant DNA technology. Such forms include, but are not limited to, 
derivatives, variants, and oligomers, as well as fusion proteins or fragments thereof. 

Polypeptides and Fragments Thereof 

The polypeptides of the invention include full length proteins encoded by the nucleic 
acid sequences set forth above. Particularly preferred polypeptides comprise the amino acid 
sequences of SEQ ID NOs:27-38. 

The polypeptides of the invention may include an N-terminal hydrophobic region that 
functions as a signal peptide, and may contain an extracellular domain, and may also contain a 
transmembrane region and a C-terminal cytoplasmic domain as well as a spacer region. 
Computer analysis may be used to predict the location of the signal peptide. 

For example, the isolated polypeptides of SEQ ID NO: 14 (IMX 28) and SEQ ID NO: 19 
(IMX 44) include an N-terminal hydrophobic region that functions as a signal peptide. 
Computer analysis predicts that the signal peptide corresponds to residues 1 to 1 7 of SEQ ID 
NO: 14, and cleavage of the signal peptide of SEQ ID NO: 14 results in a mature protein 
comprising amino acids 18 to 372. The signal peptide of IMX 44 corresponds to residues 1 to 
37 of SEQ ID NO: 1 9. The next most likely computer-predicted signal peptide cleavage sites (in 
descending order) would occur after amino acids 36, 26 and 27 of SEQ ID NO: 19. Cleavage of 
the signal peptide at position 37 thus would yield a mature protein comprising amino acids 38 
through 26 1 of SEQ ID NO: 1 9. 

The skilled artisan will recognize that the above-described boundaries of such regions of 
the polypeptide are approximate. To illustrate, the boundaries of the mature protein (which may 
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be predicted by using computer programs available for that purpose) may differ from those 
described above. 

The polypeptides of the invention may be membrane bound or they may be secreted and, 
thus, soluble. Soluble polypeptides are capable of being secreted from the cells in which they 
are expressed. In general, soluble polypeptides may be identified (and distinguished from non- 
soluble membrane-bound counterparts) by separating intact cells which express the desired 
polypeptide from the culture medium, e.g., by centrifugation, and assaying the medium 
(supernatant) for the presence of the desired polypeptide. The presence of polypeptide in the 
medium indicates that the polypeptide was secreted from the cells and thus is a soluble form of 
the protein. 

In one embodiment, the soluble polypeptides and fragments thereof comprise all or part 
of the extracellular domain, but lack the transmembrane region that would cause retention of the 
polypeptide on a cell membrane. A soluble polypeptide may include the cytoplasmic domain, or 
a portion thereof, as long as the polypeptide is secreted from the cell in which it is produced. 

In general, the use of soluble forms is advantageous for certain applications. Purification 
of the polypeptides from recombinant host cells is facilitated, since the soluble polypeptides are 
secreted from the cells. Further, soluble polypeptides are generally more suitable for 
intravenous administration. 

The invention also provides polypeptides and fragments of the extracellular domain that 
retain a desired biological activity. Particular embodiments are directed to polypeptide 
fragments that retain the ability to bind the native cognate, substrate, or counter-structure 
("binding partner"). Such a fragment may be a soluble polypeptide, as described above. In 
another embodiment, the polypeptides and fragments advantageously include regions that are 
conserved in the family as described above. 

Also provided herein are polypeptide fragments comprising at least 20, or at least 30, 
contiguous amino acids of the sequence of SEQ ID NOs:27-38. Fragments derived from the 
cytoplasmic domain find use in studies of signal transduction, and in regulating cellular 
processes associated with transduction of biological signals. Polypeptide fragments also may be 
employed as immunogens, in generating antibodies. 



Variants 



Naturally occurring variants as well as derived variants of the polypeptides and 
fragments are provided herein. 
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Variants may exhibit ammo acid sequences that are at least 80% identical. Also 
contemplated are embodiments in which a polypeptide or fragment comprises an amino acid 
sequence that is at least 90% identical, at least 95% identical at least 98% identical, at least 99% 
identical, or at least 99.9% identical to the preferred polypeptide or fragment thereof. Percent 
identity may be determined by visual inspection and mathematical calculation. Alternatively, 
the percent identity of two protein sequences can be determined by comparing sequence 
information using the GAP computer program, based on the algorithm of Needleman and 
Wunsch (J. Mol. Bio. 48:443, 1970) and available from the University of Wisconsin Genetics 
Computer Group (UWGCG). The preferred default parameters for the GAP program include: 
(1) a scoring matrix, blosum62, as described by Henikoff and Henikoff (Proc. Natl. Acad. Sci. 
USA 89:10915, 1992); (2) a gap weight of 12; (3) a gap length weight of 4; and (4) no penalty 
for end gaps. Other programs used by one skilled in the art of sequence comparison may also be 
used. 

The variants of the invention include, for example, those that result from alternate 
mRNA splicing events or from proteolytic cleavage. Alternate splicing of mRNA may, for 
example, yield a truncated but biologically active protein, such as a naturally occurring soluble 
form of the protein. Variations attributable to proteolysis include, for example, differences in 
the N- or C-tenmini upon expression in different types of host cells, due to proteolytic removal 
of one or more terminal amino acids from the protein (generally from 1-5 terminal amino acids). 
Proteins in which differences in amino acid sequence are attributable to genetic polymorphism 
(allelic variation among individuals producing the protein) are also contemplated herein. 

Additional variants within the scope of the invention include polypeptides that may be 
modified to create derivatives thereof by forming covalent or aggregative conjugates with other 
chemical moieties, such as glycosyl groups, lipids, phosphate, acetyl groups and the like. 
Covalent derivatives may be prepared by linking the chemical moieties to functional groups on 
amino acid side chains or at the N-terminus or C-terminus of a polypeptide. Conjugates 
comprising diagnostic (detectable) or therapeutic agents attached thereto are contemplated 
herein, as discussed in more detail below. 

Other derivatives include covalent or aggregative conjugates of the polypeptides with 
other proteins or polypeptides, such as by synthesis in recombinant culture as N-terminal or C- 
terminal fusions. Examples of fusion proteins are discussed below in connection with 
oligomers. Further, fusion proteins can comprise peptides added to facilitate purification and 
identification. Such peptides include, for example, poly-His or the antigenic identification 
peptides described in U.S. Patent No. 5,01 1,912 and in Hopp et aL Bio/Technology 6:1204, 

-28- 





WO 00/28033 PCT/US99/26788 
1988. One such peptide is the FLAG* peptide. Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys, which is 
highly antigenic and provides an epitope reversibly bound by a specific monoclonal antibody, 
enabling rapid assay and facile purification of expressed recombinant protein. A murine 
hybridoma designated 4E1 1 produces a monoclonal antibody that binds the FLAG® peptide in 
the presence of certain divalent metal cations, as described in U.S. Patent 5,01 1,912, hereby 
incorporated by reference. The 4E1 1 hybridoma cell line has been deposited with the American 
Type Culture Collection under accession no. HB 9259. Monoclonal antibodies that bind the 
FLAG* peptide are available from Eastman Kodak Co., Scientific Imaging Systems Division, 
New Haven, Connecticut. 

Among the polypeptides provided herein are variants of native polypeptides that retain 
the native biological activity or the substantial equivalent thereof. One example is a variant that 
binds with essentially the same binding affinity as does the native form. Binding affinity can be 
measured by conventional procedures, e.g., as described in U.S. Patent No. 5,512,457 and as set 
forth below. 

Variants include polypeptides that are substantially homologous to the native form, but 
which have an amino acid sequence different from that of the native form because of one or 
more deletions, insertions or substitutions. Particular embodiments include, but are not limited 
to, polypeptides that comprise from one to ten deletions, insertions or substitutions of amino 
acid residues, when compared to a native sequence. 

A given amino acid may be replaced, for example, by a residue having similar 
physiochemical characteristics. Examples of such conservative substitutions include 
substitution of one aliphatic residue for another, such as He, Val, Leu, or Ala for one another; 
substitutions of one polar residue for another, such as between Lys and Arg, Glu and Asp, or 
Gin and Asn; or substitutions of one aromatic residue for another, such as Phe, Trp, or Tyr for 
one another. Other conservative substitutions, e.g., involving substitutions of entire regions 
having similar hydrophobicity characteristics, are well known. 

Similarly, the DNAs of the invention include variants that differ from a native DNA 
sequence because of one or more deletions, insertions or substitutions, but that encode a 
biologically active polypeptide. 

The invention further includes polypeptides of the invention with or without associated 
native-pattern glycosylation. Polypeptides expressed in yeast or mammalian expression systems 
(e.g., COS-l or COS-7 cells) can be similar to or significantly different from a native 
polypeptide in molecular weight and glycosylation pattern, depending upon the choice of 
expression system. Expression of polypeptides of the invention in bacterial expression systems. 
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such as E, coli. provides non-glycosylated molecules. Further, a given preparation may include 
multiple differentially glycosylated species of the protein. Glycosyl groups can be removed 
through conventional methods, in particular those utilizing glycopeptidase. In general, 
glycosylated polypeptides of the invention can be incubated with a molar excess of 
glycopeptidase (Boehringer Mannheim). 

Correspondingly, similar DNA constructs that encode various additions or substitutions 
of amino acid residues or sequences, or deletions of terminal or internal residues or sequences 
are encompassed by the invention. For example, N-glycosylation sites in the polypeptide 
extracellular domain can be modified to preclude glycosylation, allowing expression of a 
reduced carbohydrate analog in mammalian and yeast expression systems. N-glycosylation sites 
in eukaryotic polypeptides are characterized by an amino acid triplet Asn-X-Y, wherein X is any 
amino acid except Pro and Y is Ser or Thr. Appropriate substitutions, additions, or deletions to 
the nucleotide sequence encoding these triplets will result in prevention of attachment of 
carbohydrate residues at the Asn side chain. Alteration of a single nucleotide, chosen so that 
Asn is replaced by a different amino acid, for example, is sufficient to inactivate an N- 
glycosylation site. Alternatively, the Ser or Thr can by replaced with another amino acid, such 
as Ala. Known procedures for inactivating N-glycosylation sites in proteins include those 
described in U.S. Patent 5,071,972 and EP 276,846, hereby incorporated by reference. 

In another example of variants, sequences encoding Cys residues that are not essential 
for biological activity can be altered to cause the Cys residues to be deleted or replaced with 
other amino acids, preventing formation of incorrect intramolecular disulfide bridges upon 
folding or renaturation. 

Other variants are prepared by modification of adjacent dibasic amino acid residues, to 
enhance expression in yeast systems in which KEX2 protease activity is present. EP 212,914 
discloses the use of site-specific mutagenesis to inactivate KEX2 protease processing sites in a 
protein. KEX2 protease processing sites are inactivated by deleting, adding or substituting 
residues to alter Arg-Arg, Arg-Lys, and Lys-Arg pairs to eliminate the occurrence of these 
adjacent basic residues. Lys-Lys pairings are considerably less susceptible to KEX2 cleavage, 
and conversion of Arg-Lys or Lys-Arg to Lys-Lys represents a conservative and preferred 
approach to inactivating KEX2 sites. 
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Oligomers 

Encompassed by the invention are oligomers or fusion proteins that contain IMX 
polypeptides. Such oligomers may be in the form of covalently-linked or non-covalentlv-linked 
multimers, including dimers, trimers, or higher oligomers. As noted above, preferred 
polypeptides are soluble and thus these oligomers may comprise soluble polypeptides. In one 
aspect of the invention, the oligomers maintain the binding ability of the polypeptide 
components and provide therefor, bivalent, trivalent, etc., binding sites. 

One embodiment of the invention is directed to oligomers comprising multiple 
polypeptides joined via covalent or non-covalent interactions between peptide moieties fused to 
the polypeptides. Such peptides may be peptide linkers (spacers), or peptides that have the 
property of promoting oligomerization. Leucine zippers and certain polypeptides derived from 
antibodies are among the peptides that can promote oligomerization of the polypeptides attached 
thereto, as described in more detail below. 

Immunoglobulin-based Oligomers 
As one alternative, an oligomer is prepared using polypeptides derived from 
immunoglobulins. Preparation of fusion proteins comprising certain heterologous polypeptides 
fused to various portions of antibody-derived polypeptides (including the Fc domain) has been 
described, e.g., by Ashkenazi et aL (PNAS USA 88:10535, 1991); Byrn et al. (Nature 344:677, 
1990); and Hollenbaugh and Aruffo ("Construction of Immunoglobulin Fusion Proteins", in 
Current Protocols in Immunology, Suppl. 4, pages 10.19.1 - 10.19.1 1, 1992). 

One embodiment of the present invention is directed to a dimer comprising two fusion 
proteins created by fusing a polypeptide of the invention to an Fc polypeptide derived from an 
antibody. A gene fusion encoding the polypeptide/Fc fusion protein is inserted into an 
appropriate expression vector. Polypeptide/Fc fusion proteins are expressed in host cells 
transformed with the recombinant expression vector, and allowed to assemble much like 
antibody molecules, whereupon interchain disulfide bonds form between the Fc moieties to yield 
divalent molecules. 

The term "Fc polypeptide" as used herein includes native and mutein forms of 
polypeptides comprising from the Fc region of an antibody. Truncated forms of such 
polypeptides containing the hinge region that promotes dimerization are also included. 
Preferred polypeptides comprise an Fc polypeptide derived from a human IgGl antibody 
comprising any or all of the CH domains "of the Fc region. 

One suitable Fc polypeptide, described in PCT application WO 93/10151, hereby 
incorporated by reference, is a single chain polypeptide extending from the N-terminal hinse 
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region to the native C-terminus of the Fc region of a human IgGl antibody. Another useful Fc 
polypeptide is the Fc mutein described in U.S. Patent 5,457,035 and in Baum et al., (EMBO J. 
13:3992-4001, 1994) incorporated herein by reference. The ammo acid sequence of this mutein 
is identical to that of the native Fc sequence presented in WO 93/10151, except that amino acid 
1 9 has been changed from Leu to Ala, amino acid 20 has been changed from Leu to GIu, and 
amino acid 22 has been changed from Gly to Ala. The mutein exhibits reduced affinity for Fc 
receptors. 

The above-described fusion proteins comprising Fc moieties (and oligomers formed 
therefrom) offer the advantage of facile purification by affinity chromatography over Protein A 
or Protein G columns. 

In other embodiments, the polypeptides of the invention may be substituted for the 
variable portion of an antibody heavy or light chain. If fusion proteins are made with both 
heavy and light chains of an antibody, it is possible to form an oligomer with as many as four 
soluble regions of the proteins of the invention. 



Peptide-linker Based Oligomers 
Alternatively, the oligomer is a fusion protein comprising multiple polypeptides, with or 
without peptide linkers (spacer peptides). Among the suitable peptide linkers are those 
described in U.S. Patents 4,751,180 and 4,935,233, which are hereby incorporated by reference. 
A DNA sequence encoding a desired peptide linker may be inserted between, and in the same 
reading frame as, the DNA sequences of the invention, using any suitable conventional 
technique. For example, a chemically synthesized oligonucleotide encoding the linker may be 
ligated between the sequences. In particular embodiments, a fusion protein comprises from two 
to four soluble LMX polypeptides, separated by peptide linkers. 

Leucine-Zippers 

Another method for preparing the oligomers of the invention involves use of a leucine 
zipper. Leucine zipper domains are peptides that promote oligomerization of the proteins in 
which they are found. Leucine zippers were originally identified in several DNA-binding 
proteins (Landschulz et al.. Science 240: 1 759, 1988), and have since been found in a variety of 
different proteins. Among the known leucine zippers are naturally occurring peptides and 
derivatives thereof that dimerize or trimerize. 

Examples of leucine zipper domains suitable for producing soluble oligomeric proteins 
are described in PCT application WO 94/10308, and the leucine zipper derived from lung 

-32- 





WO 00/28033 PCT/US99/26788 
surfactant protein D (SPD) described in Hoppe et al. {FEBS Letters 344:191, 1994), hereby 
incorporated by reference. The use of a modified leucine zipper that allows for stable 
trimerization of a heterologous protein fused thereto is described in Fanslow et al. (Semin. 
Immunol. 6:267-278, 1994). Recombinant fusion proteins comprising a soluble polypeptide 
fused to a leucine zipper peptide are expressed in suitable host cells, and the soluble oligomer 
that forms is recovered from the culture supernatant. 

In particular embodiments, leucine residues in a leucine zipper moiety are replaced by 
isoleucine residues. Such peptides comprising isoleucine may be referred to as "isoleucine 
zippers" but are encompassed by the term "leucine zippers" as employed herein. 

PRODUCTION OF POL YPEPTIDES AND FRAGMENTS THEREOF 

Expression, isolation and purification of the polypeptides and fragments of the invention 
may be accomplished by any suitable technique, including but not limited to the following: 

Expression Systems 

The present invention also provides recombinant cloning and expression vectors 
containing DNA, as well as host cell containing the recombinant vectors. Expression vectors 
comprising DNA may be used to prepare the polypeptides or fragments of the invention encoded 
by the DNA. A method for producing polypeptides comprises culturing host cells transformed 
with a recombinant expression vector encoding the polypeptide, under conditions that promote 
expression of the polypeptide, then recovering the expressed polypeptides from the culture. The 
skilled artisan will recognize that the procedure for purifying the expressed polypeptides will 
vary according to such factors as the type of host cells employed, and whether the polypeptide is 
membrane-bound or a soluble form that is secreted from the host cell. 

Any suitable expression system may be employed. The vectors include a DNA encoding 
a polypeptide or fragment of the invention, operably linked to suitable transcriptional or 
translational regulatory nucleotide sequences, such as those derived from a mammalian, 
microbial, viral, or insect gene. Examples of regulatory sequences include transcriptional 
promoters, operators, or enhancers, an mRNA ribosomal binding site, and appropriate sequences 
which control transcription and translation initiation and termination. Nucleotide sequences are 
operably linked when the regulatory sequence functionally relates to the DNA sequence. Thus, 
a promoter nucleotide sequence is operabiy linked to a DNA sequence if the promoter nucleotide 
sequence controls the transcription of the DNA sequence. An origin of replication that confers 
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the ability to replicate in the desired host cells, and a selection gene by which transformants are 
identified, are generally incorporated into the expression vector. 

In addition, a sequence encoding an appropriate signal peptide (native or heterologous) 
can be incorporated into expression vectors. A DNA sequence for a signal peptide (secretory 
leader) may be fused in frame to the nucleic acid sequence of the invention so that the DNA is 
initially transcribed, and the mRNA translated, into a fusion protein comprising the signal 
peptide. A signal peptide that is functional in the intended host cells promotes extracellular 
secretion of the polypeptide. The signal peptide is cleaved from the polypeptide upon secretion 
of polypeptide from the cell. 

The skilled artisan will also recognize that the position(s) at which the signal peptide is 
cleaved may differ from that predicted by computer program, and may vary according to such 
factors as the type of host cells employed in expressing a recombinant polypeptide. A protein 
preparation may include a mixture of protein molecules having different N-terminal amino 
acids, resulting from cleavage of the signal peptide at more than one site. 

Suitable host cells for expression of polypeptides include prokaryotes, yeast or higher 
eukaryotic cells. Mammalian or insect cells are generally preferred for use as host cells. 
Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and mammalian 
cellular hosts are described, for example, in Pouwels et al. Cloning Vectors: A Laboratory 
Manual. Elsevier, New York, (1985). Cell-free translation systems could also be employed to 
produce polypeptides using RNAs derived from DNA constructs disclosed herein. 

Prokarvotic Systems 

Prokaryotes include gram-negative or gram-positive organisms. Suitable prokaryotic 
host cells for transformation include, for example, E. coli, Bacillus subtilis. Salmonella 
typhimurium, and various other species within the genera Pseudomonas, Streptomyces. and 
Staphylococcus. In a prokaryotic host cell, such as E. coli, a polypeptide may include an N- 
terminal methionine residue to facilitate expression of the recombinant polypeptide in the 
prokaryotic host cell. The N-terminal Met may be cleaved from the expressed recombinant 
polypeptide. 

Expression vectors for use in prokaryotic host cells generally comprise one or more 
phenotypic selectable marker genes. A phenotypic selectable marker gene is, for example, a 
gene encoding a protein that confers antibiotic resistance or that supplies an autotrophic 
requirement. Examples of useful expression vectors for prokaryotic host cells include those 
derived from commercially available plasmids such as the cloning vector pBR322 (ATCC 
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3701 7). pBR322 contains genes for ampicillin and tetracycline resistance and thus provides 
simple means for identifying transformed cells. An appropriate promoter and a DNA sequence 
are inserted into the pBR322 vector. Other commercially available vectors include, for example 
pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and pGEMl (Promega Biotec, 
Madison, WI, USA). 

Promoter sequences commonly used for recombinant prokaryotic host cell expression 
vectors include p-lactamase (penicillinase), lactose promoter system (Chang et ah, Nature 
275:6X5, 1978; and Goeddel et al., Nature 281:544, 1979), tryptophan (trp) promoter system 
(Goeddel et al., Nucl. Acids Res. <?:4057, 1980; and EP-A-36776) and tac promoter (Maniatis, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, p. 412, 1982). A 
particularly useful prokaryotic host cell expression system employs a phage kP L promoter and a 
cI857ts thermolabile repressor sequence. Plasmid vectors available from the American Type 
Culture Collection which incorporate derivatives of the XP L promoter include plasmid pHUB2 
(resident in E. coli strain JMB9, ATCC 37092) and P PLc28 (resident in E. coli RR1, ATCC 
53082). 



Yeast Systems 

Alternatively, the polypeptides may be expressed in yeast host cells, preferably from the 
Saccharomyces genus (e.g., S. cerevisiae). Other genera of yeast, such as Pichia or 
Kluyveromyces, may also be employed. Yeast vectors will often contain an origin of replication 
sequence from a 2u yeast plasmid, an autonomously replicating sequence (ARS), a promoter 
region, sequences for polyadenylation, sequences for transcription termination, and a selectable 
marker gene. Suitable promoter sequences for yeast vectors include, among others, promoters 
for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255:2073, 1980) 
or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149, 1968; and Holland et al., 
Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phospho-glucose 
isomerase, and glucokinase. Other suitable vectors and promoters for use in yeast expression 
are further described in Hitzeman, EPA-73,657. Another alternative is the glucose-repressible 
ADH2 promoter described by Russell et al. (J. Biol. Chem. 258:2614, 1 982) and Beier et al. 
(Nature 300:724, 1982). Shuttle vectors replicable in both yeast and E. coli may be constructed 
by inserting DNA sequences from pBR322 for selection and replication in E. coli (Amp r gene 
and origin of replication) into the above-described yeast vectors. 
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The yeast oc-factor leader sequence may be employed to direct secretion of the 
polypeptide. The a- factor leader sequence is often inserted between the promoter sequence and 
the structural gene sequence. See, e.g., Kurjan et al., Cell 30:923, 1982 and Bitter et al., Proc. 
Natl. Acad. Sci. USA 57:5330, 1984. Other leader sequences suitable for facilitating secretion of 
recombinant polypeptides from yeast hosts are known to those of skill in the art. A leader 
sequence may be modified near its 3' end to contain one or more restriction sites. This will 
facilitate fusion of the leader sequence to the structural gene. 

Yeast transformation protocols are known to those of skill in the art. One such protocol 
is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 75:1929, 1978. The Hinnen et al. 
protocol selects for Trp + transformants in a selective medium, wherein the selective medium 
consists of 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 10 mg/ml adenine and 
20 mg/ml uracil. 

Yeast host cells transformed by vectors containing an ADH2 promoter sequence may be 
grown for inducing expression in a "rich" medium. An example of a rich medium is one 
consisting of 1% yeast extract, 2% peptone, and 1% glucose supplemented with 80 mg/ml 
adenine and 80 mg/ml uracil. Derepression of the ADH2 promoter occurs when glucose is 
exhausted from the medium. 



Mammalian or Insect Systems 
Mammalian or insect host cell culture systems also may be employed to express 
recombinant polypeptides. Bacculovirus systems for production of heterologous proteins in 
insect cells are reviewed by Luckow and Summers, Bio/Technology 6:47 (1988). Established 
cell lines of mammalian origin also may be employed. Examples of suitable mammalian host 
cell lines include the COS-7 line of monkey kidney cells (ATCC CRL 1651) (Gluzman et al., 
Cell 23:175, 1981), L cells, C127 cells, 3T3 cells (ATCC CCL 163), Chinese hamster ovary 
(CHO) cells, HeLa cells, and BHK (ATCC CRL 10) cell lines, and the CV1/EBNA cell line 
derived from the African green monkey kidney cell line CV1 (ATCC CCL 70) as described by 
McMahan et al. (EMBOJ. 10: 2821, 1991). 

Established methods for introducing DNA into mammalian cells have been described 
(Kaufman, R.J., Large Scale Mammalian Cell Culture, 1990, pp. 15-69). Additional protocols 
using commercially available reagents, such as Lipofectamine lipid reagent (Gibco/BRL) or 
Lipofectamine-Plus lipid reagent, can be used to transfect cells (Feigner et al., Proc. Natl. Acad. 
Set. USA 54:7413-7417, 1987). In addition, electroporation can be used to transfect mammalian 
cells using conventional procedures, such as those in Sambrook et al. (Molecular Cloning: A 
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Laboratory Manual. 2 ed. Vol. 1-3, Cold Spring Harbor Laboratory Press, 1989). Selection of 
stable transformants can be performed using methods known in the art, such as, for example, 
resistance to cytotoxic drugs. Kaufman et al., Meth. in Ensymology J 85:487-51 1, 1990, 
describes several selection schemes, such as dihydrofolate reductase (DHFR) resistance. A 
suitable host strain for DHFR selection can be CHO strain DX-B1 1, which is deficient in DHFR 
(Urlaub and Chasin, Proc. Natl. Acad. Sci. USA 77:42 1 6-4220, 1 980). A plasmid expressing 
the DHFR cDNA can be introduced into strain DX-B1 1 , and only cells that contain the plasmid 
can grow in the appropriate selective media. Other examples of selectable markers that can be 
incorporated into an expression vector include cDNAs conferring resistance to antibiotics, such 
as G418 and hygromycin B. Cells harboring the vector can be selected on the basis of resistance 
to these compounds. 

Transcriptional and translational control sequences for mammalian host cell expression 
vectors can be excised from viral genomes. Commonly used promoter sequences and enhancer 
sequences are derived from polyoma virus, adenovirus 2, simian virus 40 (SV40), and human 
cytomegalovirus. DNA sequences derived from the SV40 viral genome, for example, SV40 
origin, early and late promoter, enhancer, splice, and polyadenylation sites can be used to 
provide other genetic elements for expression of a structural gene sequence in a mammalian host 
cell. Viral early and late promoters are particularly useful because both are easily obtained from 
a viral genome as a fragment, which can also contain a viral origin of replication (Fiers et al., 
Nature 273: 113,1 978; Kaufman, Meth. in Enzymology, 1 990). Smaller or larger SV40 
fragments can also be used, provided the approximately 250 bp sequence extending from the 
Hind III site toward the Bgl I site located in the SV40 viral origin of replication site is included. 

Additional control sequences shown to improve expression of heterologous genes from 
mammalian expression vectors include such elements as the expression augmenting sequence 
element (EASE) derived from CHO cells (Morris et al., Animal Cell Technology, 1997, pp. 529- 
534 and PCT Application WO 97/25420) and the tripartite leader (TPL) and VA gene RNAs 
from Adenovirus 2 (Gingeras et al., J. Biol. Chem. 257:13475-13491, 1982). The internal 
ribosome entry site (IRES) sequences of viral origin allows dicistronic mRNAs to be translated 
efficiently (Oh and Sarnow, Current Opinion in Genetics and Development J:295-300, 1993; 
Ramesh et al., Nucleic Acids Research 24:2697-2700, 1996). Expression of a heterologous 
cDNA as pan of a dicistronic mRNA followed by the gene for a selectable marker (e.g. DHFR) 
has been shown to improve transfectability of the host and expression of the heterologous cDNA 
(Kaufman, Meth. in Enzymology, 1990). Exemplary expression vectors that employ dicistronic 
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mRNAs are pTR-DC/GFP described by Mosser et ah, Biotechniques 22:150-161, 1997. and 
p2A5I described by Morris et al.. Animal Cell Technology, 1997, pp. 529-534. 

A useful high expression vector, pCAVNOT, has been described by Mosley et al., Cell 
59:335-348, 1989. Other expression vectors for use in mammalian host cells can be constructed 
as disclosed by Okayama and Berg (Mol. Cell. Biol. 5:280, 1983). A useful system for stable 
high level expression of mammalian cDNAs in C 127 murine mammary epithelial cells can be 
constructed substantially as described by Cosman et al. (Mol. Immunol. 25:935, 1986). A useful 
high expression vector, PMLSV N1/N4, described by Cosman et al., Nature 5/2:768, 1984, has 
been deposited as ATCC 39890. Additional useful mammalian expression vectors are described 
in EP-A-0367566, and in WO 93/18982, incorporated by reference herein. In yet another 
alternative, the vectors can be derived from retroviruses. 

Another useful expression vector, pFLAG®, can be used. FLAG 8 ' technology is centered 
on the fusion of a low molecular weight (1 kD), hydrophilic. FLAG® marker peptide to the N- 
terminus of a recombinant protein expressed by pFLAG® expression vectors. 

Regarding signal peptides that may be employed, the native signal peptide may be 
replaced by a heterologous signal peptide or leader sequence, if desired. The choice of signal 
peptide or leader may depend on factors such as the type of host cells in which the recombinant 
polypeptide is to be produced. To illustrate, examples of heterologous signal peptides that are 
functional in mammalian host cells include the signal sequence for interleukin-7 (IL-7) 
described in United States Patent 4,965,195, the signal sequence for interleukin-2 receptor 
described in Cosman et al., Nature 312:768 (1984); the interleukin-4 receptor signal peptide 
described in EP 367,566; the type I interleukin-1 receptor signal peptide described in U.S. Patent 
4,968,607; and the type II interleukin-1 receptor signal peptide described in EP 460.846. 
Purification 

The invention also includes methods of isolating and purifying the polypeptides and 
fragments thereof. 
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Isolation and Purification 
The "isolated" polypeptides or fragments thereof encompassed by this invention are 
polypeptides or fragments that are not in an environment identical to an environment in which it 
or they can be found in nature. The "purified" polypeptides or fragments thereof encompassed 
by this invention are essentially free of association with other proteins or polypeptides, for 
example, as a purification product of recombinant expression systems such as those described 
above or as a purified product from a non-recombinant source such as naturally occurring cells 
and/or tissues. 

In one preferred embodiment, the purification of recombinant polypeptides or fragments 
can be accomplished using fusions of polypeptides or fragments of the invention to another 
polypeptide to aid in the purification of polypeptides or fragments of the invention. Such fusion 
partners can include the poly-His or other antigenic identification peptides described above as 
well as the Fc moieties described previously. 

With respect to any type of host cell, as is known to the skilled artisan, procedures for 
purifying a recombinant polypeptide or fragment will vary according to such factors as the type 
of host cells employed and whether or not the recombinant polypeptide or fragment is secreted 
into the culture medium. 

In general, the recombinant polypeptide or fragment can be isolated from the host cells if 
not secreted, or from the medium or supernatant if soluble and secreted, followed by one or 
more concentration, salting-out, ion exchange, hydrophobic interaction, affinity purification or 
size exclusion chromatography steps. As to specific ways to accomplish these steps, the culture 
medium first can be concentrated using a commercially available protein concentration filter, for 
example, an Amicon or Millipore Pellicon ultrafiltration unit. Following the concentration step, 
the concentrate can be applied to a purification matrix such as a gel filtration medium. 
Alternatively, an anion exchange resin can be employed, for example, a matrix or substrate 
having pendant diethylaminoethyl (DEAE) groups. The matrices can be acrylamide, agarose, 
dextran, cellulose or other types commonly employed in protein purification. Alternatively, a 
cation exchange step can be employed. Suitable cation exchangers include various insoluble 
matrices comprising sulfopropyl or carboxymethyl groups. In addition, a chromatofocusing step 
can be employed. Alternatively, a hydrophobic interaction chromatography step can be 
employed. Suitable matrices can be phenyl or octyl moieties bound to resins. In addition, 
affinity chromatography with a matrix which selectively binds the recombinant protein can be 
employed. Examples of such resins employed are lectin columns, dye columns, and metal- 
chelating columns. Finally, one or more reverse-phase high performance liquid chromatography 
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(RP-HPLC) steps employing hydrophobic RP-HPLC media, (e.g.. silica gel or polymer resin 
having pendant methyl, octyl, octyldecyl or other aliphatic groups) can be employed to further 
purify the polypeptides. Some or all of the foregoing purification steps, in various 
combinations, are well known and can be employed to provide an isolated and purified 
recombinant protein. 

It is also possible to utilize an affinity column comprising a polypeptide-binding protein 
of the invention, such as a monoclonal antibody generated against polypeptides of the invention, 
to affinity-purify expressed polypeptides. These polypeptides can be removed from an affinity 
column using conventional techniques, e.g., in a high salt elution buffer and then dialyzed into a 
lower salt buffer for use or by changing pH or other components depending on the affinity 
matrix utilized, or be competitively removed using the naturally occurring substrate of the 
affinity moiety, such as a polypeptide derived from the invention. 

In this aspect of the invention, polypeptide-binding proteins, such as the anti-polypeptide 
antibodies of the invention or other proteins that may interact with the polypeptide of the 
invention, can be bound to a solid phase support such as a column chromatography matrix or a 
similar substrate suitable for identifying, separating, or purifying cells that express polypeptides 
of the invention on their surface. Adherence of polypeptide-binding proteins of the invention to 
a solid phase contacting surface can be accomplished by any means. For example, magnetic 
microspheres can be coated with these polypeptide-binding proteins and held in the incubation 
vessel through a magnetic field. Suspensions of cell mixtures are contacted with the solid phase 
that has such polypeptide-binding proteins thereon. Cells having polypeptides of the invention 
on their surface bind to the fixed polypeptide-binding protein and unbound cells then are washed 
away. This affinity-binding method is useful for purifying, screening, or separating such 
polypeptide-expressing cells from solution. Methods of releasing positively selected cells from 
the solid phase are known in the art and encompass, for example, the use of enzymes. Such 
enzymes are preferably non-toxic and non-injurious to the cells and are preferably directed to 
cleaving the cell-surface binding partner. 

Alternatively, mixtures of cells suspected of containing polypeptide-expressing cells of 
the invention first can be incubated with a biotinylated polypeptide-binding protein of the 
invention. Incubation periods are typically at least one hour in duration to ensure sufficient 
binding to polypeptides of the invention. The resulting mixture then is passed through a column 
packed with avidin-coated beads, whereby the high affinity of biotin for avidin provides the 
binding of the polypeptide-binding cells to the beads. Use of avidin-coated beads is known in 
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the an. See Berenson. et al. J. Cell. Biochem., 10D:239 (1986). Wash of unbound material and 
the release of the bound cells is performed using conventional methods. 

The desired degree of purity depends on the intended use of the protein. A relatively 
high degree of purity is desired when the polypeptide is to be administered in vivo, for example. 
In such a case, the polypeptides are purified such that no protein bands corresponding to other 
proteins are detectable upon analysis by SDS-polyacrylamide gel electrophoresis (SDS-PAGE). 
It will be recognized by one skilled in the pertinent field that multiple bands corresponding to 
the polypeptide may be visualized by SDS-PAGE, due to differential glycosylation, differential 
post-translational processing, and the like. Most preferably, the polypeptide of the invention is 
purified to substantial homogeneity, as indicated by a single protein band upon analysis by SDS- 
PAGE. The protein band may be visualized by silver staining, Coomassie blue staining, or (if 
the protein is radiolabeled) by autoradiography. 

Assays 

The purified polypeptides of the invention (including proteins, polypeptides, fragments, 
variants, oligomers, and other forms) may be tested for the ability to bind a cognate, ligand, 
receptor, substrate, or counter-structure and the like ("binding partner") in any suitable assay, 
such as a conventional binding assay. To illustrate, the polypeptide may be labeled with a 
detectable reagent (e.g., a radionuclide, chromophore, enzyme that catalyzes a colorimetric or 
fluorometric reaction, and the like). The labeled polypeptide is contacted with cells expressing 
the binding partner. The cells then are washed to remove unbound labeled polypeptide, and the 
presence of cell-bound label is determined by a suitable technique, chosen according to the 
nature of the label. 

One example of a binding assay procedure is as follows. A recombinant expression 
vector containing the binding partner cDNA is constructed using methods known in the art. 
CV1-EBNA-1 cells in 10 cm 2 dishes are transfected with the recombinant expression vector. 
CV-l/EBNA-1 cells (ATCC CRL 10478) constitutively express EBV nuclear antigen-1 driven 
from the CMV immediate-early enhancer/promoter. CV1-EBNA-1 was derived from the 
African Green Monkey kidney cell line CV-1 (ATCC CCL 70), as described by McMahan et al. 
(EMBOJ. 10:2821, 1991). 

The transfected cells are cultured for 24 hours, and the cells in each dish then are split 
into a 24-weII plate. After culturing an additional 48 hours, the transfected cells (about 4 x 10 4 
cells/well) are washed with BM-NFDM, which is binding medium (RPMI 1640 containing 25 
mg/ml bovine serum albumin, 2 mg/ml sodium azide, 20 mM Hepes pH 7.2) to which 50 mg/ml 
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nonfat dry milk has been added. The cells then are incubated for 1 hour at 37°C with various 
concentrations of. for example, a soluble polypeptide/Fc fusion protein made as set forth above. 
Cells then are washed and incubated with a constant saturating concentration of a ,25 I-mouse 
anti-human IgG in binding medium, with gentle agitation for 1 hour at 37°C. After extensive 
washing, cells are released via trypsinization. 

The mouse anti-human IgG employed above is directed against the Fc region of human 
IgG and can be obtained from Jackson Immunoresearch Laboratories, Inc., West Grove, PA. 
The antibody is radioiodinated using the standard chloramine-T method. The antibody will bind 
to the Fc portion of any polypeptide/Fc protein that has bound to the cells. In all assays, non- 
specific binding of ,25 I-antibody is assayed in the absence of the Fc fusion protein/Fc, as well as 
in the presence of the Fc fusion protein and a 200- fold molar excess of unlabeled mouse anti- 
human IgG antibody. 

Cell-bound ' I-antibody is quantified on a Packard Autogamma counter. Affinity 

calculations (Scatchard, Ann. N. Y Acad. Sci. 51 :660, 1949) are generated on RS/1 (BBN 

Software, Boston, MA) run on a Microvax computer. 

Another type of suitable binding assay is a competitive binding assay. To illustrate, 

biological activity of a variant may be determined by assaying for the variant's ability to 

compete with the native protein for binding to the binding partner. 

Competitive binding assays can be performed by conventional methodology. Reagents 

that may be employed in competitive binding assays include radiolabeled IMX polypeptides and 

intact cells expressing the IMX polypeptide (endogenous or recombinant) on the cell surface. 

For example, a radiolabeled soluble IMX polypeptide fragment can be used to compete with a 
soluble IMX polypeptide variant for binding to the cell surface binding partner. Instead of intact 
cells, one could substitute a soluble binding partner/Fc fusion protein bound to a solid phase 
through the interaction of Protein A or Protein G (on the solid phase) with the Fc moiety. 
Chromatography columns that contain Protein A and Protein G include those available from 
Pharmacia Biotech, Inc., Piscataway, NJ. 

Another type of competitive binding assay utilizes the radiolabeled soluble binding 
partner, such as a soluble binding partner/Fc fusion protein, and intact cells expressing the IMX 
polypeptide. Qualitative results can be obtained by competitive autoradiographic plate binding 
assays, while Scatchard plots (Scatchard, Ann. N. Y Acad. Sci. 5 1 :660, 1 949) may be utilized to 
generate quantitative results. 
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USE OF IMX NUCLEIC ACIDS OR OLIGONUCLEOTIDES 

In addition to being used to express polypeptides as described above, the nucleic acids of 
the invention, including DNA ? and oligonucleotides thereof can be used; 

_ as probes to identify nucleic acid encoding proteins homologous to IMX 
polypeptides; 

to identify human chromosomes; 

_ to map genes on human chromosome numbers 7, 19, and 22; 

_ to identify genes associated with certain diseases, syndromes, or other 
conditions associated with human chromosome numbers 7, 19, and 22; 

_ as single-stranded sense or antisense oligonucleotides, to inhibit expression of 
polypeptide encoded by the EMX sequences; 

_ to help detect defective genes in an individual; and 
_ for gene therapy. 

Probes 

Among the uses of nucleic acids of the invention is the use of fragments as probes or 
primers. Such fragments generally comprise at least about 1 7 contiguous nucleotides of a DNA 
sequence. In other embodiments, a DNA fragment comprises at least 30 ? or at least 60, 
contiguous nucleotides of a DNA sequence. 

Because homologs of SEQ ID NOs;l-26, from other mammalian species, are 
contemplated herein, probes based on the human DNA sequence of SEQ ID NOs;l-26 may be 
used to screen cDNA libraries derived from other mammalian species, using conventional cross- 
species hybridization techniques. 

Using knowledge of the genetic code in combination with the amino acid sequences set 
forth above, sets of degenerate oligonucleotides can be prepared. Such oligonucleotides are 
useful as primers, e.g., in polymerase chain reactions (PCR), whereby DNA fragments are 
isolated and amplified. 
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Identifying Chromosome Number 

All or a portion of the nucleic acids of SEQ ID NOs:l-26, including oligonucleotides, 
can be used by those skilled in the art using well-known techniques to identify the human 
chromosomes, and the specific locus thereof, that contain the DNA of IMX family members. 
Useful techniques include, but are not limited to, using the sequence or portions, including 
oligonucleotides, as a probe in various well-known techniques such as in situ hybridization to 
chromosome spreads, Southern blot hybridization to hybrid cell lines, fluorescent tagging, and 
radiation hybrid mapping. 

For example, chromosomes can be mapped by radiation hybridization. 
PCR is performed using the Whitehead Institute/MIT Center for Genome Research Genebridge4 
panel of 93 radiation hybrids (http://www-genome.wi.mit.edu/ftp/distribution/ 
human_STS_releases/juIy97/rhmap/genebridge4.html). Primers are used which lie within a 
putative exon of the gene of interest and which amplify a product from human genomic DNA, 
but do not amplify hamster genomic DNA. The results of the PCRs are converted into a data 
vector that is submitted to the Whitehead/MIT Radiation Mapping site on the internet 
(http://www-seq.wi.mit.edu). The data is scored and the chromosomal assignment and 
placement relative to known Sequence Tag Site (STS) markers on the radiation hybrid map is 
provided. The following web site provides additional information about radiation hybrid 

mapping: http://www-genome.wi.mit.edu/ftp/distribution/human_STS_releases/july97/ 
07-97.INTRO.html). 



Identifying Associated Diseases 

As described previously, IMX molecules numbered 4, 21, 44, and 56 have been mapped 
to particular chromosome locations. Thus, the nucleic acid of a particular IMX molecule or a 
fragment thereof can be used by one skilled in the art using well-known techniques to analyze 
abnormalities associated with gene mapping to such chromosomes. This enables one to 
distinguish conditions in which this marker is rearranged or deleted. In addition, nucleotides of 
such IMX molecules or fragments thereof can be used as a positional marker to map other genes 
of previously unknown location. 

The DNA may be used in developing treatments for any disorder mediated (directly or 
indirectly) by defective, or insufficient amounts of, the genes corresponding to the nucleic acids 
of the invention. Disclosure herein of native nucleotide sequences permits the detection of 
defective genes, and the replacement thereof with normal genes. Defective genes may be 
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detected in in vitro diagnostic assays, and by comparison of a native nucleotide sequence 
disclosed herein with that of a gene derived from a person suspected of harboring a defect in this 
gene. 



Sense- Anti sense 

Other useful fragments of the nucleic acids include antisense or sense oligonucleotides 
comprising a single-stranded nucleic acid sequence (either RNA or DNA) capable of binding to 
target mRNA (sense) or DNA (antisense) sequences. Antisense or sense oligonucleotides 
according to the present invention comprise a fragment of DNA (SEQ ID NOs:l-26). Such a 
fragment generally comprises at least about 14 nucleotides, preferably from about 14 to about 30 
nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon a cDNA 
sequence encoding a given protein is described in, for example, Stein and Cohen {Cancer Res. 
48:2659, 1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in 
the formation of duplexes that block or inhibit protein expression by one of several means, 
including enhanced degradation of the mRNA by RNAseH, inhibition of splicing, premature 
termination of transcription or translation, or by other means. The antisense oligonucleotides 
thus may be used to block expression of proteins. Antisense or sense oligonucleotides further 
comprise oligonucleotides having modified sugar-phosphodiester backbones (or other sugar 
linkages, such as those described in W09 1/06629) and wherein such sugar linkages are resistant 
to endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo 
(i.e., capable of resisting enzymatic degradation) but retain sequence specificity to be able to 
bind to target nucleotide sequences. 

Other examples of sense or antisense oligonucleotides include those oligonucleotides 
which are covalently linked to organic moieties, such as those described in WO 90/10448, and 
other moieties that increases affinity of the oligonucleotide for a target nucleic acid sequence, 
such as poly-(L-lysine). Further still, intercalating agents, such as ellipticine, and alkylating 
agents or metal complexes may be attached to sense or antisense oligonucleotides to modify 
binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the target 
nucleic acid sequence by any gene transfer method, including, for example, lipofection, CaPC- 
mediated DNA transfection, electroporation, or by using gene transfer vectors such as Epstein- 
Barr virus. 
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Sense or antisense oligonucleotides also may be introduced into a cell containing the 
target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, 
cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell surface 
receptors. Preferably, conjugation of the ligand binding molecule does not substantially 
interfere with the ability of the ligand binding molecule to bind to its corresponding molecule or 
receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into 
the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell 
containing the target nucleic acid sequence by formation of an oligonucleotide-lipid complex, as 
described in WO 90/10448. The sense or antisense oligonucleotide-lipid complex is preferably 
dissociated within the cell by an endogenous lipase. 

USE OF IMX POLYPEPTIDES AND FRAGMENTED POLYPEPTIDES 

Uses include, but are not limited to, the following: 

Purifying proteins and measuring activity thereof 
Delivery Agents 

Therapeutic and Research Reagents 

Molecular weight and Isoelectric focusing markers 

Controls for peptide fragmentation 

Identification of unknown proteins 

Preparation of Antibodies 

Purification Reagents 

Each of the polypeptides of the invention finds use as a protein purification reagent. The 
polypeptides may be attached to a solid support material and used to purify (binding partner) 
proteins by affinity chromatography. In particular embodiments, a polypeptide (in any form 
described herein that is capable of binding (binding partner)) is attached to a solid support by 
conventional procedures. As one example, chromatography columns containing functional 
groups that will react with functional groups on amino acid side chains of proteins are available 
(Pharmacia Biotech, Inc., Piscataway, NJ). In an alternative, a polypeptide/Fc protein (as 
discussed above) is attached to Protein A- or Protein G-containing chromatography columns 
through interaction with the Fc moiety. 
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The polypeptide also finds use in purifying or identifying cells that express (binding 
partner) on the cell surface. Polypeptides are bound to a solid phase such as a column 
chromatography matrix or a similar suitable substrate. For example, magnetic microspheres can 
be coated with the polypeptides and held in an incubation vessel through a magnetic field. 
Suspensions of cell mixtures containing (binding partner) expressing cells are contacted with the 
solid phase having the polypeptides thereon. Cells expressing (binding partner) on the cell 
surface bind to the fixed polypeptides, and unbound cells then are washed away. 

Alternatively, the polypeptides can be conjugated to a detectable moiety, then incubated 
with cells to be tested for (binding partner) expression. After incubation, unbound labeled 
matter is removed and the presence or absence of the detectable moiety on the cells is 
determined. 

In a further alternative, mixtures of cells suspected of containing (binding partner) cells 
are incubated with biotinylated polypeptides. Incubation periods are typically at least one hour 
in duration to ensure sufficient binding. The resulting mixture then is passed through a column 
packed with avidin-coated beads, whereby the high affinity of biotin for avidin provides binding 
of the desired cells to the beads. Procedures for using avidin-coated beads are known (see 
Berenson, et al. J. Cell. Biochem., 10D.239, 1986). Washing to remove unbound material, and 
the release of the bound cells, are performed using conventional methods. 

Measuring Activity 

Polypeptides also find use in measuring the biological activity of (binding partner) 
protein in terms of their binding affinity. The polypeptides thus may be employed by those 
conducting "quality assurance" studies, e.g., to monitor shelf life and stability of protein under 
different conditions. For example, the polypeptides may be employed in a binding affinity study 
to measure the biological activity of a (binding partner) protein that has been stored at different 
temperatures, or produced in different cell types. The proteins also may be used to determine 
whether biological activity is retained after modification of a (binding partner) protein (e.g., 
chemical modification, truncation, mutation, etc.). The binding affinity of the modified (binding 
partner) protein is compared to that of an unmodified (binding partner) protein to detect any 
adverse impact of the modifications on biological activity of (binding partner). The biological 
activity of a (binding partner) protein thus can be ascertained before it is used in a research 
study, for example. 
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Delivery Agents 

The polypeptides also find use as carriers for delivering agents attached thereto to cells 
bearing binding partner. Cells expressing (binding partner) include those identified in (add 
citation if reference known). The polypeptides thus can be used to deliver diagnostic or 
therapeutic agents to such cells (or to other cell types found to express (binding partner) on the 
cell surface) in in vitro or in vivo procedures. 

Detectable (diagnostic) and therapeutic agents that may be attached to a polypeptide 
include, but are not limited to, toxins, other cytotoxic agents, drugs, radionuclides, 
chromophores, enzymes that catalyze a colorimetric or fluorometric reaction, and the like, with 
the particular agent being chosen according to the intended application. Among the toxins are 
ncin, abrin, diphtheria toxin, Pseudomonas aeruginosa exotoxin A, ribosomal inactivating 
proteins, mycotoxins such as trichothecenes, and derivatives and fragments (e.g., single chains) 
thereof. Radionuclides suitable for diagnostic use include, but are not limited to, l23 I, 13, I, 99m Tc, 
"in, and 76 Br. Examples of radionuclides suitable for therapeutic use are UI I 211 At 77 Br l86 Re 
,88 Re, 212 Pb, 2,2 Bi, ,09 Pd, M Cu, and 67 Cu. 

Such agents may be attached to the polypeptide by any suitable conventional procedure. 
The polypeptide comprises functional groups on amino acid side chains that can be reacted with 
functional groups on a desired agent to form covalent bonds, for example. Alternatively, the 
protein or agent may be derivatized to generate or attach a desired reactive functional group. 
The derealization may involve attachment of one of the bi functional coupling reagents available 
for attaching various molecules to proteins (Pierce Chemical Company, Rockford, Illinois). A 
number of techniques for radiolabeling proteins are known. Radionuclide metals may be 
attached to polypeptides by using a suitable bifunctional chelating agent, for example. 

Conjugates comprising polypeptides and a suitable diagnostic or therapeutic agent 
(preferably covalently linked) are thus prepared. The conjugates are administered or otherwise 
employed in an amount appropriate for the particular application. 

Therapeutic Agents 

Polypeptides of the invention may be used in developing treatments for any disorder 
mediated (directly or indirectly) by defective, or insufficient amounts of the polypeptides. These 
polypeptides may be administered to a mammal afflicted with such a disorder. 

The polypeptides may also be employed in inhibiting a biological activity of (binding 
partner), in in vitro or in vivo procedures. For example, a purified polypeptide may be used to 
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inhibit binding of (binding partner) to endogenous cell surface (binding partner). Biological 
effects that result from the binding of (binding partner) to endogenous receptors thus are 
inhibited. 

IMX polypeptides may be administered to a mammal to treat a (binding partner- 
mediated disorder. Such (binding partner)-mediated disorders include conditions caused 
(directly or indirectly) or exacerbated by (binding partner). 

Compositions of the present invention may contain a polypeptide in any form described 
herein, such as native proteins, variants, derivatives, oligomers, and biologically active 
fragments. In particular embodiments, the composition comprises a soluble polypeptide or an 
oligomer comprising soluble polypeptides. 

Compositions comprising an effective amount of a polypeptide of the present invention, 
in combination with other components such as a physiologically acceptable diluent, carrier, or 
excipient, are provided herein. The polypeptides can be formulated according to known 
methods used to prepare pharmaceutically useful compositions. They can be combined in 
admixture, either as the sole active material or with other known active materials suitable for a 
given indication, with pharmaceutically acceptable diluents (e.g., saline, Tris-HCl, acetate, and 
phosphate buffered solutions), preservatives (e.g., thimerosal, benzyl alcohol, parabens), 
emulsifiers, solubilizers, adjuvants and/or carriers. Suitable formulations for pharmaceutical 
compositions include those described in Remington's Pharmaceutical Sciences, 16th ed. 1980, 
Mack Publishing Company, Easton, PA. 

In addition, such compositions can be complexed with polyethylene glycol (PEG), metal 
ions, or incorporated into polymeric compounds such as polyacetic acid, polyglycolic acid, 
hydrogels, dextran, etc.. or incorporated into liposomes, microemulsions, micelles, unilamellar 
or multilamellar vesicles, erythrocyte ghosts or spheroblasts. Such compositions will influence 
the physical state, solubility, stability, rate of in vivo release, and rate of in vivo clearance, and 
are thus chosen according to the intended application. 

The compositions of the invention can be administered in any suitable manner, e.g., 
topically, parenterally, or by inhalation. The term "parenteral" includes injection, e.g., by 
subcutaneous, intravenous, or intramuscular routes, also including localized administration, e.g., 
at a site of disease or injury. Sustained release from implants is also contemplated. One skilled 
in the pertinent art will recognize that suitable dosages will vary, depending upon such factors as 
the nature of the disorder to be treated, the patient's body weight, age, and general condition, and 
the route of administration. Preliminary doses can be determined according to animal tests, and 
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the scaling of dosages for human administration is performed according to art-accepted 
practices. 

Compositions comprising nucleic acids in physiologically acceptable formulations are 
also contemplated. DNA may be formulated for injection, for example. 

Research Agents 

Another use of the polypeptide of the present invention is as a research tool for studying 
the biological effects that result from inhibiting interactions between polypeptides of the 
invention and their "binding partner". Polypeptides also may be employed in in vitro assays for 
detecting binding partner or cells expressing the binding partner or the interactions thereof 

Molecular Weieht. Isoelectric Point Markers 

The polypeptides of the present invention can be subjected to fragmentation into smaller 
peptides by chemical and enzymatic means, and the peptide fragments so produced can be used 
in the analysis of other proteins or polypeptides. For example, such peptide fragments can be 
used as peptide molecular weight markers, peptide isoelectric point markers, or in the analysis of 
the degree of peptide fragmentation. Thus, the invention also includes these polypeptides and 
peptide fragments, as well as kits to aid in the determination of the apparent molecular weight 
and isoelectric point of an unknown protein and kits to assess the degree of fragmentation of an 
unknown protein. 

Although all methods of fragmentation are encompassed by the invention, chemical 
fragmentation is a preferred embodiment, and includes the use of cyanogen bromide to cleave 
under neutral or acidic conditions such that specific cleavage occurs at methionine residues (E. 
Gross, Methods in Enz. 1 1 :238-255, 1967). This can further include additional steps, such as a 
carboxymethylation step to convert cysteine residues to an unreactive species. 

Enzymatic fragmentation is another preferred embodiment, and includes the use of a 
protease such as Asparaginylendo-peptidase, Arginylendo-peptidase, Achromobacter protease I, 
Trypsin, Staphlococcus aureus V8 protease, Endoproteinase Asp-N, or Endoproteinase Lys-C 
under conventional conditions to result in cleavage at specific amino acid residues. 
Asparaginylendo-peptidase can cleave specifically on the carboxyl side of the asparagine 
residues present within the polypeptides of the invention. Arginylendo-peptidase can cleave 
specifically on the carboxyl side of the arginine residues present within these polypeptides. 
Achromobacter protease I can cleave specifically on the carboxyl side of the lysine residues 
present within the polypeptides (Sakiyama and Nakat, U.S. Patent No. 5,248,599; T. Masaki et 
al., Biochim. Biophys. Acta 660:44-50, 1981; T. Masaki et aL Biochim. Biophys. Acta 660:51- 
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55, 1981). Trypsin can cleave specifically on the carboxyl side of the areinine and lysine 
residues present within polypeptides of the invention. Enzymatic fragmentation may also occur 
with a protease that cleaves at multiple amino acid residues. For example, Staphlococcus aureus 
V8 protease can cleave specifically on the carboxyl side of the aspartic and glutamic acid 
residues present within polypeptides (D. W. Cleveland, 7. Biol. Chem. J: 1 102-1 106, 1977). 
Endoproteinase Asp-N can cleave specifically on the amino side of the asparagine residues 
present within polypeptides. Endoproteinase Lys-C can cleave specifically on the carboxyl side 
of the lysine residues present within polypeptides of the invention. Other enzymatic and 
chemical treatments can likewise be used to specifically fragment these polypeptides into a 
unique set of specific peptides. 

Of course, the peptides and fragments of the polypeptides of the invention can also be 
produced by conventional recombinant processes and synthetic processes well known in the art. 
With regard to recombinant processes, the polypeptides and peptide fragments encompassed by 
invention can have variable molecular weights, depending upon the host cell in which they are 
expressed. Glycosylation of polypeptides and peptide fragments of the invention in various cell 
types can result in variations of the molecular weight of these pieces, depending upon the extent 
of modification. The size of these pieces can be most heterogeneous with fragments of 
polypeptide derived from the extracellular portion of the polypeptide. Consistent polypeptides 
and peptide fragments can be obtained by using polypeptides derived entirely from the 
transmembrane and cytoplasmic regions, pretreating with N-glycanase to remove glycosylation, 
or expressing the polypeptides in bacterial hosts. 

The molecular weight of these polypeptides can also be varied by fusing additional 
peptide sequences to both the amino and carboxyl terminal ends of polypeptides of the 
invention. Fusions of additional peptide sequences at the amino and carboxyl terminal ends of 
polypeptides of the invention can be used to enhance expression of these polypeptides or aid in 
the purification of the protein. In addition, fusions of additional peptide sequences at the amino 
and carboxyl terminal ends of polypeptides of the invention will alter some, but usually not all, 
of the fragmented peptides of the polypeptides generated by enzymatic or chemical treatment. 
Of course, mutations can be introduced into polypeptides of the invention using routine and 
known techniques of molecular biology. For example, a mutation can be designed so as to 
eliminate a site of proteolytic cleavage by a specific enzyme or a site of cleavage by a specific 
chemically induced fragmentation procedure. The elimination of the site will alter the peptide 
fingerprint of polypeptides of the invention upon fragmentation with the specific enzyme or 
chemical procedure. 

- 51 - 



WO 00/28033 




PCT/US99/26788 



The polypeptides and the resultant fragmented peptides can be analyzed by methods 
including sedimentation, electrophoresis, chromatography, and mass spectrometry to determine 
their molecular weights. Because the unique amino acid sequence of each piece specifies a 
molecular weight, these pieces can thereafter serve as molecular weight markers using such 
analysis techniques to assist in the determination of the molecular weight of an unknown 
protein, polypeptides or fragments thereof. The molecular weight markers of the invention serve 
particularly well as molecular weight markers for the estimation of the apparent molecular 
weight of proteins that have similar apparent molecular weights and, consequently, allow 
increased accuracy in the determination of apparent molecular weight of proteins. 

When the invention relates to the use of fragmented peptide molecular weight markers, 
those markers are preferably at least 10 amino acids in size. More preferably, these fragmented 
peptide molecular weight markers are between 10 and 100 amino acids in size. Even more 
preferable are fragmented peptide molecular weight markers between 10 and 50 amino acids in 
size and especially between 10 and 35 amino acids in size. Most preferable are fragmented 
peptide molecular weight markers between 10 and 20 amino acids in size. 

Among the methods for determining molecular weight are sedimentation, gel 
electrophoresis, chromatography, and mass spectrometry. A particularly preferred embodiment 
is denaturing polyacrylamide gel electrophoresis (U. K. Laemmli, Nature 227:680-685, 1970). 
Conventionally, the method uses two separate lanes of a gel containing sodium dodecyl sulfate 
and a concentration of acrylarnide between 6-20%. The ability to simultaneously resolve the 
marker and the sample under identical conditions allows for increased accuracy. It is 
understood, of course, that many different techniques can be used for the determination of the 
molecular weight of an unknown protein using polypeptides of the invention, and that this 
embodiment in no way limits the scope of the invention. 

Each unglycosylated polypeptide or fragment thereof has a pi that is intrinsically 
determined by its unique amino acid sequence (which pi can be estimated by the skilled artisan 
using any of the computer programs designed to predict pi values currently available, calculated 
using any well-known amino acid pKa table, or measured empirically). Therefore these 
polypeptides and fragments thereof can serve as specific markers to assist in the determination 
of the isoelectric point of an unknown protein, polypeptide, or fragmented peptide using 
techniques such as isoelectric focusing. These polypeptide or fragmented peptide markers serve 
particularly well for the estimation of apparent isoelectric points of unknown proteins that have 
apparent isoelectric points close to that of the polypeptide or fragmented peptide markers of the 
invention. 
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The technique of isoelectric focusing can be further combined with other techniques such 
as gel electrophoresis to simultaneously separate a protein on the basis of molecular weight and 
charge. The ability to simultaneously resolve these polypeptide or fragmented peptide markers 
and the unknown protein under identical conditions allows for increased accuracy in the 
determination of the apparent isoelectric point of the unknown protein. This is of particular 
interest in techniques, such as two dimensional electrophoresis (T.D. Brock and M.T. Madigan, 
Biology of Microorganisms 76-77 (Prentice Hall, 6d ed. 1991)), where the nature of the 
procedure dictates that any markers should be resolved simultaneously with the unknown 
protein. In addition, with such methods, these polypeptides and fragmented peptides thereof can 
assist in the determination of both the isoelectric point and molecular weight of an unknown 
protein or fragmented peptide. 

Polypeptides and fragmented peptides can be visualized using two different methods that 
allow a discrimination between the unknown protein and the molecular weight markers. In one 
embodiment, the polypeptide and fragmented peptide molecular weight markers of the invention 
can be visualized using antibodies generated against these markers and conventional 
immunoblotting techniques. This detection is performed under conventional conditions that do 
not result in the detection of the unknown protein. It is understood that it may not be possible to 
generate antibodies against all polypeptide fragments of the invention, since small peptides may 
not contain immunogenic epitopes. It is further understood that not all antibodies will work in 
this assay; however, those antibodies which are able to bind polypeptides and fragments of the 
invention can be readily determined using conventional techniques. 

The unknown protein is also visualized by using a conventional staining procedure. The 
molar excess of unknown protein to polypeptide or fragmented peptide molecular weight 
markers of the invention is such that the conventional staining procedure predominantly detects 
the unknown protein. The level of these polypeptide or fragmented peptide molecular weight 
markers is such as to allow little or no detection of these markers by the conventional staining 
method. The preferred molar excess of unknown protein to polypeptide molecular weight 
markers of the invention is between 2 and 100,000 fold. More preferably, the preferred molar 
excess of unknown protein to these polypeptide molecular weight markers is between 10 and 
10,000 fold and especially between 100 and 1,000 fold. 

It is understood of course that many techniques can be used for the determination and 
detection of molecular weight and isoelectric point of an unknown protein, polypeptides, and 
fragmented peptides thereof using these polypeptide molecular weight markers and peptide 
fragments thereof and that these embodiments in no way limit the scope of the invention. 
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In another embodiment, the analysis of the progressive fragmentation of the polypeptides 
of the invention into specific peptides (D. W. Cleveland et al.. J. Biol. Chem. 252:1 102-1 106, 
1 977), such as by altering the time or temperature of the fragmentation reaction, can be used as a 
control for the extent of cleavage of an unknown protein. For example, cleavage of the same 
amount of polypeptide and unknown protein under identical conditions can allow for a direct 
comparison of the extent of fragmentation. Conditions that result in the complete fragmentation 
of the polypeptide can also result in complete fragmentation of the unknown protein. 

As to the specific use of the polypeptides and fragmented peptides of the invention as 
molecular weight markers, the fragmentation of the polypeptides of SEQ ID NOs:27-38 with 
cyanogen bromide generates a unique set of fragmented peptide molecular weight markers. An 
additional fragment results if the initiating methionine is present. The distribution of methionine 
residues determines the number of amino acids in each peptide and the unique amino acid 
composition of each peptide determines its molecular weight. 

In addition, the preferred purified polypeptides of the invention (SEQ ID NOs:27-38) 
have calculated molecular weights of approximately 3683, 1783, 1 1248, 75503, 43040, 8051, 
33306, 3515, 10736, 25162, and 2450 Daltons. 

Where an intact protein is used, the use of these polypeptide molecular weight markers 
allows increased accuracy in the determination of apparent molecular weight of proteins that 
have apparent molecular weights close to 3683, 1783, 1 1248, 75503, 43040, 8051, 33306, 3515, 
10736, 25 162, or 2450 Daltons. Where fragments are used, there is increased accuracy in 
determining molecular weight over the range of the molecular weights of the fragment. 

Finally, as to the kits that are encompassed by the invention, the constituents of such kits 
can be varied, but typically contain the polypeptide and fragmented peptide molecular weight 
markers. Also, such kits can contain the polypeptides wherein a site necessary for fragmentation 
has been removed. Furthermore, the kits can contain reagents for the specific cleavage of the 
polypeptide and the unknown protein by chemical or enzymatic cleavage. Kits can further 
contain antibodies directed against polypeptides or fragments thereof of the invention. 

Identification of Unknown Proteins 

As set forth above, a polypeptide or peptide fingerprint can be entered into or compared 
to a database of known proteins to assist in the identification of the unknown protein using mass 
spectrometry (W.J. Henzel et al., Proc. Natl. Acad. Sci. USA 90:501 1-5015, 1993; D. Fenyo et 
al., Electrophoresis 19:998-1005, 1998). A variety of computer software programs to facilitate 
these comparisons are accessible via the Internet, such as Protein Prospector (Internet site: 
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prospector.uscf.edu). Multildem (Internet site: xvww.expasy.ch/sproumuUiident.html). 
PeptideSearch (Internet site: www.mann.embl-heiedelberg.de.. .dcSearch/FR_PeptideSearch 
Form.html). and ProFound (Internet site: www.chait-sgi.rockefeller.edu/cgi-bin/prot-id- 
frag.html). These programs allow the user to specify the cleavage agent and the molecular 
weights of the fragmented peptides within a designated tolerance. The programs compare these 
molecular weights to protein databases to assist in determining the identity of the unknown 
protein. 

In addition, a polypeptide or peptide digest can be sequenced using tandem mass 
spectrometry (MS/MS) and the resulting sequence searched against databases (J.K. Eng, et al., J. 
Am. Soc. Mass Spec. 5:976-989 (1994); M. Mann and M. Wilm,,W. Chem. 66:4390-4399 
( 1 994); J. A. Taylor and R.S. Johnson, Rapid Comm. Mass Spec. 11:1 067- 1 075 ( 1 997)). 
Searching programs that can be used in this process exist on the Internet, such as Lutefisk 97 
(Internet site: www.lsbc.com:70/Lutefisk97.html), and the Protein Prospector, Peptide Search 
and ProFound programs described above. Therefore, adding the sequence of a gene and its 
predicted protein sequence arid peptide fragments to a sequence database can aid in the 
identification of unknown proteins using tandem mass spectrometry. 

Antibodies 

Antibodies that are immunoreactive with the polypeptides of the invention are provided 
herein. Such antibodies specifically bind to the polypeptides via the antigen-binding sites of the 
antibody (as opposed to non-specific binding). Thus, the polypeptides, fragments, variants, 
fusion proteins, etc.. as set forth above may be employed as immunogens in producing 
antibodies immunoreactive therewith. 

Polyclonal and monoclonal antibodies may be prepared by conventional techniques. 
See, for example, Monoclonal Antibodies, Hybridomas: A New Dimension in Biological 
Analyses, Kennet et al. (eds.), Plenum Press, New York (1980); and Antibodies: A Laboratory 
Manual , Harlow and Land (eds.). Cold Spring Harbor Laboratory Press, Cold Spring Harbor. 
NY, (1988). 

Antigen-binding fragments of such antibodies, which may be produced by conventional 
techniques, are also encompassed by the present invention. Examples of such fragments 
include, but are not limited to, Fab and F(ab') 2 fragments. Antibody fragments and derivatives 
produced by genetic engineering techniques are also provided. 

The monoclonal antibodies of the present invention include chimeric antibodies, e.g., 
humanized versions of murine monoclonal antibodies. Such humanized antibodies may be 
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prepared by known techniques, and offer the advantage of reduced immunogenicitv when the 



comprises the variable region of a murine antibody (or just the antigen binding site thereof) and 
a constant region derived from a human antibody. Alternatively, a humanized antibody 
fragment may comprise the antigen binding site of a murine monoclonal antibody and a variable 
region fragment (lacking the antigen-binding site) derived from a human antibody. Procedures 
for the production of chimeric and further engineered monoclonal antibodies include those 
described in Riechmann et aL (Nature 352:323, 1988), Liu et al. (PNAS 54:3439, 1987), Larrick 
et ah (Bio/Technology* 7:934, 1989), and Winter and Harris (TIPS 74:139, May, 1993). 
Procedures to generate antibodies transgenically can be found in GB 2,272,440, US Patent Nos. 
5,569,825 and 5,545,806 and related patents claiming priority therefrom, all of which are 
incorporated by reference herein. 

In one embodiment, the antibodies are specific for the polypeptides of the present 
invention, and do not cross-react with other proteins. Screening procedures by which such 
antibodies may be identified are well known, and may involve immunoaffmity chromatography, 
for example. 

Hybridoma cell lines that produce monoclonal antibodies specific for the polypeptides of 
the invention are also contemplated herein. Such hybridomas may be produced and identified 
by conventional techniques. One method for producing such a hybridoma cell line comprises 
immunizing an animal with a polypeptide; harvesting spleen cells from the immunized animal; 
fusing said spleen cells to a myeloma cell line, thereby generating hybridoma cells; and 
identifying a hybridoma cell line that produces a monoclonal antibody that binds the 
polypeptide. The monoclonal antibodies may be recovered by conventional techniques. 

Uses Thereof 

The antibodies of the invention can be used in assays to detect the presence of the 
polypeptides or fragments of the invention, either in vitro or in vivo. The antibodies also may be 
employed in purifying polypeptides or fragments of the invention by immunoaffmity 
chromatography. 

Those antibodies that additionally can block binding of the polypeptides of the invention 
to the binding partner may be used to inhibit a biological activity that results from such binding 
Such blocking antibodies may be identified using any suitable assay procedure, such as by 
testing antibodies for the ability to inhibit binding of the binding partner to certain cells 
expressing the binding partner. Alternatively, blocking antibodies mav be identified in assavs 
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for the ability to inhibit a biological effect that results from binding of the binding panner to 
target cells. 

Such an antibody may be employed in an in vitro procedure, or administered in vivo to 
inhibit a biological activity mediated by the entity that generated the antibody. Disorders caused 
or exacerbated (directly or indirectly) by the interaction of (binding panner) with cell surface 
(binding panner) receptor thus may be treated. A therapeutic method involves in vivo 
administration of a blocking antibody to a mammal in an amount effective in inhibiting a 
(binding partner)-mediated biological activity. Monoclonal antibodies are generally preferred for 
use in such therapeutic methods. In one embodiment, an antigen-binding antibody fragment is 
employed. 

Antibodies may be screened for agonistic (i.e., ligand-mimicking) properties. Such 
antibodies, upon binding to cell surface antigen, induce biological effects (e.g., transduction of 
biological signals) similar to the biological effects induced when (binding partner) binds to cell 
surface antigen. 

Compositions comprising an antibody that is directed against a polypeptide of the 
invention, and a physiologically acceptable diluent, excipient, or carrier, are provided herein. 

Also provided herein are conjugates comprising a detectable (e.g., diagnostic) or 
therapeutic agent, attached to the antibody. Examples of such agents are presented above. The 
conjugates find use in in vitro or in vivo procedures. 

The following examples are provided to further illustrate particular embodiments of the 
invention, and are not to be construed as limiting the scope of the present invention. 

EXAMPLE 1: Isolation of the IMX Nucleic Acids 
The T84 Epithelial Barrier Model 

As discussed above, damage to the intestinal epithelial barrier is a hallmark of (IBD), 
and a number of in vitro models of epithelial barrier function have been developed over the 
years. The best characterized of these models is the T84 intestinal epithelial barrier system, 
Dharmsathaphorn et al.. Am. J. Physiol., 246:G204-G208, 1984 and Madara et al., J. Cell Biol., 
101:2124-2133.1985). 

T84 cells were plated on 75 mm polycarbonate transwell filter inserts (Costar) and grown 
in DME/F12 (1:1) containing 10% heat-inactivated bovine calf serum. The cells were 
maintained at confluence for 2-3 days, and integrity of the epithelial barrier was determined by 
measuring transepithelial electrical resistance (TER) using an EVOM epithelial voltohmmeter 
(World Precision Instruments). When the TER values were greater than 1000 ohms/cm2 and 
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were stable, cells were treated with interferon-g (30 ng/ml. Genzyme) added to the basolateral 
side of the membrane. At various times after treatment (4, 24 and 44h), TERs were measured to 
monitor the interferon-induced disruption of the barrier, and RNA was harvested from the cells 
at those time points using TRIzol reagent (Life Technologies). RNA was extracted using 
conventional methods and subsequently used for TOGA™ analysis as described in Example 2. 
TOGA™ analysis 

This example describes a method for determining mRNA expression characteristics. The 
isolated RNA was analyzed using a method of simultaneous sequence-specific identification of 
mRNAs known as TOGA™ (Total Gene expression Analysis) described in U.S. Patent No. 
5,459,037 and U.S. Patent No. 5,807,680, hereby incorporated herein by reference. Preferably, 
prior to the application of the TOGA™ technique, the isolated RNA was enriched to form a 
starting polyA-containing mRNA population by methods known in the art. In a preferred 
embodiment, the TOGA™ method further comprised an additional PCR step performed using 
four separate reactions, one for each of the four 5' PCR primers, and cDNA templates prepared 
from a population of antisense cRNAs. A final PCR step used 256 5' PCR primers in 64 
subpools for each of the four reactions of the previous step produced PCR products that were 
cDNA fragments that corresponded to the 3'-region of the starting mRNA population. 

The produced PCR products were then identified by a) the sequence of at least the 5' 
seven base pairs, preferably the sequence of the entire fragment, and b) the length of the 
fragment. These two parameters, sequence and fragment length, were used to compare the 
obtained PCR products to a database of known polynucleotide sequences. A database search for 
homologous sequences in Genbank resulted in no matches, indicating the novelty of the IMX 
sequences of the invention. The intensities of the PCR products were compared across the 4 
input RNA samples (t=0, 4hr, 24hr, 44hr) and species that were regulated were identified and 
further characterized (Table I ). 

The DSTs of SEQ ID NO: 1 -26 and fragments thereof are useful as probes to study and 
diagnose the changes in gene expression demonstrated by the data of Table 1 . Alternatively, 
polypeptides and fragments thereof that are the translations of SEQ ID NO: 1-26 and fragments 
thereof are useful as probes to study and diagnose the changes in gene expression demonstrated 
bv the data of Table 1. 
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Table 1 . IMX Nucleic Acid Molecules. Relative expression levels determined by TOGA™. 



10 



15 



.20 



EXAM PLE 2: Further Characterization of TMX4 
Nucleic Acid Molecules and Polypeptides 
An additional 600bp sequence (SEQ ID NO: 1 1 ) that included DST IMX4 (SEQ ID 
NO: 1 ) was obtained by anchor PGR from a T84 library. A clone approximately 2.0kb in length 
was then isolated from a commercial library (Origene). The sequence of the 3' end of the clone 
(SEQ ID NO:12) included the sequence of the IMX4 DST. The sequence of the 5* end of the 
clone (SEQ ID NO:I3) matched pan of the human ApoL gene (AF019225. Figure 21). 
Complete sequence of the clone was not obtained. However, comparison of 5' end of the clone 
suggests it represents an alternative splice product to the reported ApoL sequence, i.e., bases 1- 
168 match to 2 exons on PAC carrying the ApoL gene but are not included in the reported 
complete cDNA. See Figure 22. A polypeptide translated from IMX4 is provided in SEQ ID 
NO: 27. 

EXAMPLE 3: Fu rther Characterization of IMX! 0 and IMX21 
Nucleic Acid Molecules and Polypeptide 
Further experiments failed to show regulation of these DSTs in replications using PCR 
with extended primers and Northern blots. A polypeptide translated from LMX10 (SEQ ID 
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NO: 14) is provided in SEQ ID NO: 28. A polypeptide translated from IMX21 (SEQ ID NO: 15) 
is provided in SEQ ID NO: 29. 



EXAMPLE 4: Further Characterization of IMX28 
Nucleic Acid Molecules and Polypeptides 

One polypeptide translated from IMX28 is provided in SEQ ID NO: 30. 

A GST fusion protein of IMX28 was used in studies designed to identify potential 
ligands for this enzyme.The GST fusion of IMX28 was prepared as follows: IMX28-8, a clone 
with the full length coding sequence, was excised from pGEM by Sail-Not I digestion and 
subcloned into Sall-NotI digested pGEX-5X-3 (Pharmacia). The pGEX-5X-3 plasmid contains 
the Schistosoma japonicum glutathione-S-transferase (GST) gene 5' to the Sall-NotI cloning 
site. Insertion of the IMX28-8 DNA fragment into this cloning site results in a ftision gene 
containing GST-IMX28 in the proper reading frame. The complete sequence of the fusion 
protein consists of GST followed by a Factor Xa tripeptide cleavage site, the peptide consisting 
of Gly-Ile-Pro-Arg-Asn-Ser-Arg-Val-Asp-AIa-Thr that is derived partially from the pGEX 
vector and partially from the Sall-NotI fragment of IMX28 and IMX28. The vector containing 
the fusion gene was electroporated into competent bacterial cells and a single colony containing 
the DNA, confirmed by diagnostic restriction digestion, was expanded to a 250ml culture for 
purification of the fusion protein. 

Briefly, cells from these cultures were pelleted and resupended in STE buffer containing 
a protease inhibitor cocktail and 20ug/ml hen egg lysozyme. The cells were lysed by addition of 
10% sarcosyl in STE (final sarcosyl concentration was 1.5%), sonicated and centrifiiged to 
remove cell debris. The supernatant was passed over a Glutathione-Sepharose 4B column, 
washed sequentially with 1.5% sarcosyl in STE and then STE. The bound material, containing 
the GST-IMX28 fusion protein was eluted by adding 20mM Glutathione in STE. The eluted 
material was analyzed by SDS-PAGE to confirm the presence of the GST-IMX28 fusion 
protein. 

A recombinant adenovirus construct expressing TMX28 was used to infect T84 cells to 
determine effects on epithelial barrier formation. To construct adenoviral vectors with untagged 
full length Imx28 and NH2-terminally FLAG tagged Imx28, the pADEASYl system described 
by He et al. (1998 ) Proc. Natl. Acad. Sci". USA 95: 2509-2514, was used. For the untagged 
construct a Notl fragment carrying the full coding region of Imx28 was inserted into the 
polylinker of the pAdtrackCMV shuttle vector. After confirmation of the correct orientation the 
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vector was cut with the restriction enzyme Pacl and co-transformed into E. coli . Strain BJ5183 
along with the pADEasyl vector. Recombinants containing the Imx28 product were screened for 
as described and by gene specific PCR. To generate non-replicative. infectious viral particles the 
helper strain 293EBNA-EBNA or 293MSR was transfected with Prnel linearized recombinant 
vector. After 3 days virus was harvested by freeze-thaw lysis of the transfected cells. Several 
rounds of infection of these cells with viral supematents were performed to boost titers. The 
tagged version was prepared by addition of the FLAG sequence and a Gly-Gly-Gly-Gly spacer 
region onto the full length coding sequence of Imx28 via PCR with a 5' oligo with the 
incorporated nucleotides for the tag and spacer. The PCR product was cloned into a TA- vector 
and excised with NotI and cloned as above. Screening and viral particle production was as 
above. No deleterious or positive effects were observed on T84 barrier function in the presence 
or absence of IFN-gamma. 



EXAMP LE 5: Further Characterization of IMX32 
Nucleic Acid Molecules and Polypeptides 

The original clone produced by extending the IMX32 DST was 1588 bp in length (SEQ 
ID NO: 1 7). The first round of anchor PCR yielded a 670bp product that was used to produce a 
contig that extended the sequence 585 bases to 2173 (SEQ ID NO: 17). Another round of anchor 
PCR yielded the 1676 bp product, immunex32-a24.seq. (SEQ ID NO: 18) which further 
extended the contig for Imx32 to 2834 bp. In addition, the immunex32-a24.seq product revealed 
an alternative splice lacking bases 21 1-639 of IMX32-1 clone. 

There is an ORF for 155 amino acids at the 5' end of the sequence (SEQ ID NO:31 
translated from SEQ ID NO: 16; SEQ ID NO:32 translated from SEQ ID NO: 18). BLAST 
analysis suggests that the ORF may contain a KRAB-like domain because it has partial matches 
to Zn-finger proteins which contain KRAB domains at their N-terminus such as ZNF140. 

EXAMPLE 6: Further Characterization of IMX39 
Nucleic Acid Molecules and Polypeptides 

One polypeptide translated from IMX39 is provided in SEQ ID NO: 33. 

IMX39 FLAG-tagged and untagged adenoviral vector versions were prepared as outlined 
above for Imx28 in Example 4, above. Controls for the expression were infection with other 
FLAG- tagged adenoviral delivered proteins such as IMX5. Based upon the predicted structure 
from its cDNA sequence, it was expected that the IMX39 polypeptide would be a cytoplasmic 
protein. However, since some secreted cytokines, such as those of the IL-1 family, lack a 
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predicted signal sequence but are secreted from the cell, this expectation was tested 
experimentally in both T84 cells via adenoviral mediated transduction and via transfection into 
CV1EBNA cells with another expression vector, pDC412. Despite evidence for succesful 
expression in the transfected cells, no product was found in the media as determined by FLAG 
western (T84 system) or 35-S radiolabeled product (CV1EBNA system). The positive control in 
the CV1 system was an Imx44-Fc fusion protein known to be secrete. In both systems empty 
vectors were used as a negative control. The attempted expression of IMX39 polypeptides had 
no apparent effect on barrier function in the absence or presence of IFN-gamma. 



EXAMPLE 7: Further Characterization of I1V1X40 
Nucleic Acid Molecules and Polypeptides 

Anchor PCR using an Origene human small intestine library revealed a product of 1669 
bp (SEQ ID NO: 20). Bases 1-265 of the DST IMX40 (SEQ ID NO: 7) align with bases 1565- 
1669 of this sequence. The anchor PCR product has a potential 155 aa ORF from bases 185-650. 
A translation of this ORF is given in SEQ ID NO:34. This ORF has no recognizable features, 
motifs, or database homologies. 



EXAMP LE 8: Further Characterization of IMX42 
Nucleic Acid Molecules and Polypeptides 

Several attempts to screen libraries using DST IMX42 (SEQ ID NO:8) as well as anchor 
PCR and RACE have not produced a clone with a longer sequence. BLAST comparison against 
against imxhutdb extended sequence approx 200 bases to 592 bases (SEQ ID NO: 21). 
However, the sequence extension did not doesn't increase any database matches found by 
comparison to the DST IMX42 sequence. The contig has a potential ORF of 134 amino acids 
(SEQ ID NO: 36) which, while a COOH terminal extension of SEQ ID NO:35, may be a partial 
sequence. 
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EXAMPLE 9: Further Characterization of IMX44 
Nucleic Acid Molecules and Polypeptides 

Longer polynucleotides corresponding to DST IMX44 have been identified (SEQ ID 
NO:22, 23, 24). A translated polypeptide is found in SEQ ED NO:37. A soluble Fc form of 
IMX44 polypeptide was synthesized and used in various assays. DvlX44-Fc had no effect on 
T84 barrier function in the absence or presence of IFN-gamma. IMX44-Fc had no effect on 
natural killer (NK) cell activation. IMX44-Fc had no positive hits on cognate screen assays. 

A soluble FLAG polyHis form was also produced. No activity in cellular activation 
assays was found nor any alteration of cytokine production using this polypeptide in assays. 

The expression of IMX44 in various murine models of gut inflammation was determined 
by Northern and array analysis. Little to no regulation of transcript was found was found in 
anti-CD3-induced ileitis in C57BL/6 mice, DSS-induced colitis in BALB/c mice or C57BL/6 
mice, mdr\ knock out mice with colitis, and IFN-gamma stimulated LN T cells. 

EXAMPLE 10: Further Characterization of IMX56 
Nucleic Acid Molecules and Polypeptides 

A translated IMX56 polypeptide is found in SEQ ID NO:38. Comparison of the DST 
sequence IMX56 to IMAGE consortium clones extended the sequence (SEQ ID NO:25) which 
was y on sequenced PAC to the described end of human ApoL. Anchor PCR using a T84 
library produced results that indicated that the IMX56 DST is derived from an alternate 3* UTR 
of ApoL. 



EXAM PLE 1 1 : Monoclonal Antibodies That Bind IMX Polypeptides 

Monoclonal antibodies that bind the polypeptides of the invention can be prepared by 
methods well known in the art. Suitable immunogens that may be employed in generating such 
antibodies include, but are not limited to, purified IMX polypeptides or an immunogenic 
fragment thereof such as the extracellular domain, or fusion proteins containing IMX 
polypeptides (e.g., a soluble IMX 21poIypeptide/Fc fusion protein). 

Purified IMX polypeptides of the invention can be used to generate monoclonal 
antibodies immunoreactive therewith, using conventional techniques such as those described in 
U.S. Patent 4,41 1,993. Briefly, mice are immunized with an IMX polypeptide immunogen 
emulsified in complete Freund's adjuvant, and injected in amounts ranging from 10-100 :g 
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subcutaneously or intraperitoneally. Ten to twelve days later, the immunized animals are 
boosted with additional IMX polypeptide immunogen emulsified in incomplete Freund's 
adjuvant. Mice are periodically boosted thereafter on a weekly to bi-weekly immunization 
schedule. Serum samples are periodically taken by retro-orbital bleeding or tail-tip excision to 
test for anti-IMX polypeptide antibodies by dot blot assay, ELISA (Enzyme-Linked 
Immunosorbent Assay) or inhibition of IMX polypeptide/binding partner interactions. 

Following detection of an appropriate antibody titer, positive animals are provided one 
last intravenous injection of IMX polypeptide immunogen in saline. Three to four days later, the 
animals are sacrificed, spleen cells harvested, and spleen cells are fused to a murine myeloma 
cell line, e.g., NS1 or preferably P3x63Ag8.653 (ATCC CRL 1580). Fusions generate 
hybridoma cells, which are plated in multiple microtiter plates in a HAT (hypoxanthine, 
aminopterin and thymidine) selective medium to inhibit proliferation of non-fused cells, 
myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells are screened by ELISA for reactivity against the purified IMX 
polypeptide of interest by adaptations of the techniques disclosed in Engvall et al., 
1mm tin och em . 8:871, 1971 and in U.S. Patent 4,703,004. A preferred screening technique is the 
antibody capture technique described in Beckmann et al., (J. Immunol. 144:4212, 1990) 
Positive hybridoma cells can be injected intraperitoneally into syngeneic BALB/c mice to 
produce ascites containing high concentrations of anti-IMX polypeptide monoclonal antibodies. 
Alternatively, hybridoma cells can be grown in vitro in flasks or roller bottles by various 
techniques. Monoclonal antibodies produced in mouse ascites can be purified by ammonium 
sulfate precipitation, followed by gel exclusion chromatography. Alternatively, affinity 
chromatography based upon binding of antibody to Protein A or Protein G can also be used, as 
can affinity chromatography based upon binding to the IMX polypeptide of interest. 

Alternatively, IMX nucleic acid molecules or fragments thereof can be expressed to 
produce IMX polypeptides or fragments thereof, that can be used to make antibodies that are 
useful for identifying corresponding polypeptides in techniques such as western blotting, 
immunocytochemistry ,and ELISA assays using standard techniques such as those described in 
U.S. Patent No. 4,900,81 1, incorporated by reference herein. 

The references cited herein are incorporated by reference herein in their entirety. 
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We claim: 

1 . A purified polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence of SEQ ID NO:27; 

b) a polypeptide comprising an amino acid sequence of SEQ ID NO:28; 

c) a polypeptide comprising an amino acid sequence of SEQ ID NO:29; 

d) a polypeptide comprising an amino acid sequence of SEQ ID NO:30; 

e) a polypeptide comprising an amino acid sequence of SEQ ID NO:31; 
0 a polypeptide comprising an amino acid sequence of SEQ ID NO:32; 

g) a polypeptide comprising an amino acid sequence of SEQ ID NO:33; 

h) a polypeptide comprising an amino acid sequence of SEQ ID NO:34; 

i) a polypeptide comprising an amino acid sequence of SEQ ID NO:35; 
j) a polypeptide comprising an amino acid sequence of SEQ ID NO:36; 
k) a polypeptide comprising an amino acid sequence of SEQ ID NO:37; 

1) a polypeptide comprising an amino acid sequence of SEQ ID NO:38; and 

m) a fragment of the polypeptide of (a), (b), (c), (d), (e), (f), (g), (h), (i), (j), (k), or (1) 

wherein the expression of the mRNA encoding such polypeptide is altered in a T84 model of gut 

barrier function. 



2. A purified polypeptide comprising an amino acid sequence that is at least 80% identical 
to an amino acid sequence of claim 1. 



3. An isolated DNA molecule encoding a purified polypeptide comprising an amino acid 
sequence that is at least 80% identical to an amino acid sequence of claim 1. 



4. An isolated DNA molecule, wherein said isolated DNA molecule comprises a nucleotide 
sequence selected from the group consisting of: 

a) SEQ ID NO:l; 

b) SEQ ID NO:2; 

c) SEQ ID NO:3; 
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d) SEQ ID NO:4; 

e) SEQ ID NO: 5; 
0 SEQ ID NO:6; 

g) SEQ ID NO:7; 

h) SEQ ID NO:8; 

i) SEQ ID NO:9; and 

j) SEQ ID NO: 1 0 wherein the isolated DNA molecule has altered expression in a T84 
model of gut barrier function. 



5. An isolated DNA molecule, wherein said DNA molecule encodes a polypeptide 
comprising an amino acid sequence that is at least 80% identical to an amino acid sequence 
encoded by the DNA of claim 4. 



6. An expression vector comprising the isolate DNA molecule according to claim 4. 



7. A host cell transformed with the expression vector according to claim 5. 



8. An isolated DNA molecule, wherein said isolated DNA molecule comprises a nucleotide 
sequence selected from the group consisting of: SEQ ID NO:l - 26. 

9. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a polynucleotide having a sequence selected from 
the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO: 1 -26 or a polynucleotide which is 
hybridizable to SEQ ID NO: 1-26; 

(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ID NO: 1- 
26 or a polypeptide fragment encoded by the cDNA sequence which is hybridizable to SEQ ID 
NO: 1 -26; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ID NO: 1- 
26 or a polypeptide epitope encoded by a cDNA sequence which is hybridizable to SEQ ID 
NO: 1-26; 



- 66 - 





WO 00/28033 PCT/US99/26788 

(e) a polynucleotide encoding a polypeptide of a translation of SEQ ID NO: 1-26, 
having biological activity; 

(0 a polynucleotide which is a variant of SEQ ID NO: 1 -26; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO: 1 -26; 

(h) a polynucleotide which encodes a species homologue of a translation of SEQ ID 
NO: 1-26; 

(i) a polynucleotide capable of hybridizing under stringent conditions to anv 

one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not hybridize 
under stringent conditions to a nucleic acid molecule having a nucleotide sequence of only A 
residues or of only T residues. 

10. The isolated nucleic acid molecule of claim 9, wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 

1 1 . The isolated nucleic acid molecule of claim 9, wherein the 

polynucleotide fragment comprises a nucleotide sequence encoding a polypeptide chosen from 
the group consisting of: 

(a) a polypeptide having the polypeptide sequence identified as a translation of 
SEQ ID NO: 1-26; 

(b) a polypeptide having the polypeptide sequence of SEQ ID NO: 27-38; and 

(c) a polypeptide encoded by the cDNA which is hybridizable to SEQ ID NO: 1-26. 

12. The isolated nucleic acid molecule of claim 9, wherein the polynucleotide fragment 
comprises the entire nucleotide sequence of SEQ ID NO: 1-26 or the cDNA sequence which is 
hybridizable to SEQ ID NO: 1-26. 

13. The isolated nucleic acid molecule of claim 1 1 wherein the nucleotide sequence 
comprises sequential nucleotide deletions from sequence encoding either the C-terminus or the 
N-terminus. 



14. The isolated nucleic acid molecule of claim 10 wherein the nucleotide sequence 
comprises sequential nucleotide deletions from sequence encoding either the C-terminus or the 
N-terminus. 
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15. A recombinant vector comprising the isolated nucleic acid molecule of claim 9. 

16. A method of making a recombinant host cell comprising the isolated nucleic acid 
molecule of claim 9. 



1 7. A recombinant host cell produced by the method of claim 1 6. 

18. An isolated polypeptide comprising an amino acid sequence at least 90% identical to a 
sequence selected from the group consisting of: 

(a) a polypeptide fragment of a polypeptide encoded by a polynucleotide of SEQ ID 
NO: 1-26; 

(b) a polypeptide having the sequence of SEQ ID NO: 27-38: 

(c) a polypeptide domain of a polypeptide encoded by a polynucleotide of SEQ ID NO: 



1-26; 



1-26; 



26; 



and 



1-26. 



(d) a polypeptide epitope of a polypeptide encoded by a polynucleotide of SEQ ID NO: 

(e) a secreted form of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-26; 
(0 a full length protein of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1- 

(g) a variant of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-26; 

(h) an allelic variant of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-26; 

(i) a species homologue of a polypeptide encoded by a polynucleotide of SEQ ID NO: 



19. The isolated polypeptide of claim 1 8, wherein the full length polypeptide comprises 
sequential amino acid deletions from the C-terminus. 

20. The isolated polypeptide of claim 1 8, wherein the mature polypeptide comprises 
sequential amino acid deletions from the C-terminus. 

21. The isolated polypeptide of claim 1 8, wherein the full length polypeptide comprises 
sequential amino acid deletions from the N-terminus. 
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22. The isolated polypeptide of claim 18. wherein the mature polypeptide comprises 
sequential amino acid deletions from the N-terminus. 

23. An isolated antibody that binds specifically to the isolated polypeptide of claim 1 8. 

24. A recombinant host cell that expresses the isolated polypeptide of claim 18. 

25. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 24 under conditions such that said 
polypeptide is expressed; and 

(b) recovering said polypeptide. 

26. The polypeptide produced by the method of claim 25. 

27. A method for preventing, treating, or ameliorating a medical condition, comprising 
administering to a mammalian subject a therapeutically effective amount of the polypeptide of 
claim 18 or the polynucleotide of claim 9. 

28. The method of claim 25 wherein the medical condition is irritable bowel disease. 

29. A method of diagnosing irritable bowel disease or a susceptibility to irritable bowel 
disease in a subject comprising: 

(a) determining the presence or absence of a polynucleotide of claim 9; and 

(b) diagnosing irritable bowel disease or a susceptibility to irritable bowel disease based 
on the presence or absence of said polynucleotide. 

30. A method of diagnosing irritable bowel disease or a susceptibility to irritable bowel 
disease in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of claim 18 in a 
biological sample; and 

(b) diagnosing irritable bowel disease or a susceptibility to irritable bowel disease based 
on the presence or amount of expression of the polypeptide. 



31. A method for identifying a binding partner to the polypeptide of claim 18 



comprising: 
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(a) contacting the polypeptide of claim 18 with a binding partner; and 

(b) determining whether the binding partner affects an activity of the 
polypeptide. 



32. A method of identifying an activity in a biological assay, wherein the method comprises 

(a) expressing the polynucleotide of SEQ ID NO: 1-26 in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the polypeptide in the supernatant having the activity. 
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CCGG7AAG7AAJUZAG7CAGAAAA7TAGCA7GAAAGCAG777AGCA77GGGAGGAAGCACA 

GATCTCTAGAGCTGTCCTGTCGCTGCCCAGGATTGACCTG7GTGTAAGTCCCAATAAACT 
CACC7AC7CACCAAAAA 
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GAATTTAATACGACTCACTATAGGGAATTTGGCCCTCGAGGCCAAGAATTCGGCACGAGvj 
CAACAACAACAACAAAAAAAAACTGAACATCTCCATATTACTGACACCCAATTCAAGAAA 
CAAAATATTACAGCCCCTTCCAGGATATTCCTGGGGTCTCTTCCATCTCTACTAACCCC"- 
GACTACAAACAGCCTCCACCTATTTCACCTGACATTGTACTTTATGAAAGCAGCAGT'^'-^ 
CAGATGGGGCTATTTTGCCCCCTGGGGACATTAGGGAGTATCTGGAGACAC^GAGGGTTG 
TGTCTACTTGGGGGGAGTTGTGTTACTGCATCCAGTGAGTCCAGGGATCCAGGGATG-CG 
CTCAACATCCTGAAATGCACAGGGAACCCCCACACATAGAACAGAGAAATTGCTGAGCCA 
AAATGTCAGCAGTGTCACAGCTGACACCCTGATATACACACTATCACACAGTATC^GCTC 
TTTCGGGCTCAGGATCTTTTTCATTCTAATCATCTCATAGGAAACAGAAATGTCA T TTAG 
AGGTAGGTACAGTCCACAACAAAGAAGAACCTGAGTTTTTTTTTTTTTTT^AATCAGCC" 
GGTGCCTTTAGAGCTAGGATTTAGTTTCTATTCTTTCTGTCTCATTTTCAAGTGATT" 1 — ,Z. 
TTCTTCAAATGGCATC7ACTGGGCTCAAGAACTGGAGATCCCCACAAAGC~G=vGA~^^ 
ATGGGAATTTTGTACACACCCACACAGGTATACACTTCCATTTACATGCAGACATCCAC^ 
CACAGATACACACATCCGGAGACCAAGACAGAACGCAAACTGCCCCATAAAAGCACGGT^" 
CCCCAAACAGGAGAAACGCACCATTCACTCCAGGGAGGTATCTATTTG^TTAATT^AGC^ 
TCTGATAGTCAGGCTGTTGCCAAGCCCAGCTCTGAAACTCTTCCCCTCTAGGAAAGAAAG 

ATGGATTTTTTCTTTACTCAAGAATATAGATCTAAAAAAAAAAAAJvAAAAAAGT^GG^GG 
CCGCAAGCTTATTCCCTT7AGTGA 
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CGTACCGGGCCCCCCCTCGAGG7CGACGG7A7CGA7AAGC77GA7A7CGAA77CGCGGCC 

GCTGAGAAAT7AAC7CCCCGGGGCCGCCGGG77GAC7GCGC7GCC7GGGCCGGAGG7C7'" 

C7CCGGCCAGGGAGCGC7G7GGGAAGGGGC7CGAGCGGCCAGGGCCAGGCGAGGCCGGGG 

GGGCGGGGGGTTAGGGGACCGCGGGGCTACTCTTGGGAGCGCCCCTGTCCGGCTGGCTGC 

GCGCGGGTTTTA-ATAGCATCTTTCGGACTTGTC7TCGCGGGCCCAGTCCCCGACCTCGG 

CGCTGCC7GGGC7CC7GCAGCC7C7CCC7AAG7C77C7CCAAACGACCACC7CACGGA77 

CCTTA7GGA7CGCAGC7CCAAGAGGAGGCAGG7GAAGCC777GGCAGC77C7C7GC7GG- 

AGCTC77GA77A7GA7AG77CAGA7GACAG7GA7777AAAG77GGAGA7GCC7CAGGAC7 

CGC7GAT7C77GAGAAGAG7CAAAAC7GGAGC7C7CAAAAAA7GGACCATATTC7GA777 

GCTGTG777G7C7GGGAGA7AA7AG7GAGGACGC7GA7GAAA7AA77CAG7G7GACAA77 

G7GGCA77ACAG7CCA7GAAGG77G77A7GGAG77GA7GGAGAGAG7GAG7C7A77A7Gn 

G77CAGC77C7GAAA,nC7CCAC7GAACC77GG7777G7GA7GGC7G7AAA7G7GG7G777 

C7CCTAGC7G7GA.-.C7G7G7CC7.-_A7CAGGA7GG.A=,7777CAAGGAGACAGA7GC7GGA.i 

GATGGG77CA7A. 7G77 i G7GCCC7G7A7G77CC7GGAG7AGCC777GGAGA7A7TGACA 

AA77A.CGACCAG7AACAC7AACGGAAA7GAAC7A77CCAAA7A7GG7GCCAAGGAG7G7- 

GC7777G7GAAGACCC7CGC777GC7AGAAC7GGGG777GCA77AGC7G7GA7GCAGGGA 

7GTGCAGAGCC7A777CCA7G7GACC7GTGC7CAAAAGGAAGGTC7GCTTTCAGAGGCAG 

CGGCGGAAGAGGA7A7AGCAGA7CCATTCTT7GC77A77G7AAGCAACATGCAGA7AGG7 

TAGACAGAAAG7GGAAGAGAAAAAACTACTTGGCTCTACAG7CC7ATTGTAAAA7GTC77 

7GCAAGAGAGAGAGAAGCAAC7A7CACCAGAAGCACAGGCAAGGA7CAATGCCCGGC77C 

AGCA.G7A7CG7GCCAAAGCAGAAC7AGC7CGA7C7ACCAGACGCCAGGCC7GGG77CCA.- 

GGGAAAAA77GCCCAGACCAC7CACCAGCAG7GC77CAGC7A77CG7AAAC77A7GCGGA 

AAGCAGAAC7CA7GGGGA7CAG7ACAGA7A7C777CCAG7GGACAA77CAGA7AC7AG77 

C7AG7G7GGA7GG.-_i.GGAGAAAACA7AAGCAACCAGC7C7CAC7GCAGATTTTG7GAi.77 

A77A7777GAGAG.-«iA7A7GCGCA7GA7TCAAA77CAGGAAAA7A7GGCTGAACAAAAGA 

A7ATAJW\GA7A-.n77AGAG.AA7GAACAAGAAAAGC77CA7G7AGAA7ATAATAAGC7A7 

G7GAA7C7 . 7AGAAGAAC - AC AAAAC C 7 G AA7 G G AAAAC7 7 C G AAG 7 G AAG G AC AA G GAA 

7A7GGGC777AC7AGGCAGAA7CACAGGGCAGAA377GAA7A7ACCGGCAAT777GCGPG 
CACCCAAGGAGAGAAAACCAAG7AAAAAAGAAGGAGGCACACAAAAGACATCTAC-C77C 
C7GCAG7AC77TA7AG7TG7GGGAT77G7AAGAAGAACCA7GA7CAGCA7CT7C7777P~ 
7G7G7GA7ACC7G7A.^AC7ACA77ACCA7C77GGA7G7C7GGA7CC7CC7CTTACAAGG^ 
7GCC.AAGAAAGACCAAAAACAG77A77GGCAG7GC7CGGAA7G7GACCAGGCAGGGAGCA 
G7GACA7GGAAGCAGA7A7GGCCA7GGAAACCC7ACCAGA7GGAACCAAACGA7C-^GG- 
GGCAGA77A_-.GG.-^CCAG7GA.AA777G77CCACAGGA7G7GCCACCAGAACCCAA.GA-.GA 

i a. v.^unl^-.cn.^.h.jnGn.H-.no.-.oGnv.CA^CGAAGCTTCGTTCCTGA 

AACA7GAGGAAAGAG77CCTAGAGAGAGAAGACAAAGACAGTC7G7G7^GCAAAAGAAG~ 
CCAAGGC7G.AAGA777AA.GA-AC7GAA7G7GCaAC77GC.AiGGGAAC7GGAC-ACAA'-GAAA 
A7C77G7CAGG7G7GA7GAA7GCAGAC7C7GC7ACCA7777GGC7G777GGATCC7CC77 
7GAAAAAG7C7CC7AAACAGACAGGCTACGGA7GGA7A737CAGGAA7G7GA77C'' :, ~C-'" 
CT7CCAAGGAAGA7GAAAA7GAAGC7GAAAGAAAAAA7A7A7C7CAGGAGC^CAACA7*GG 
AACAGAAAAA7CCAAAGAAATAAAAGA7TT7C7G7AG7G77777GAAAAGTT7GCAGC7"" 
ATGTAATAGCAGATAAAA7TTCTAATTGTAAAATGT7AAA77GAGCGGCCGCGAA7TC-~ 
GCAGCCCGGGGGA7CCAC7AG77C7AGAGCGGCCGCCACCGCGG7GGAGCTCCAGC7 
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ATTAACCCTCACTAAAGGGAACAAAAGCTGGAGCTCCACCGCGGTGGCGGCCGCTCTAGA 

ACTAGTGGATCCCCCGGGCTGCAGGAATTCGGCACGAGGTGCGCGGCTGCAACGGCAGC 

GCGGGAAGCTCGGGCCGGCAGGGTTTCCCCGCACGCTGGCGCCCAGCTCCC^G^ 

gccgctgtaagtttcgctttccattcagtggaaaacgaaagc-Sggcg^ 

CGCGGGGCCAGACCAAGGCGGGCCCGGAGCGGAACTTCGGTC^CAGC^-G--"--— 

CAGTGCCGAGGTGGAACTCAGCAGCGGAGGCTGGACGCTTGCATGGCGCTTGAGAG^T^'" 
CATCGTGCCTGGCTCACATAAGCGCTTCCTGGAAGTGAAGTCGTGCTGTCC^GaVcG^- 
GGCAGGCAGCTGCGGCCTGGGGGTTTTGGAGTGATGAGGAA7GAGCAAGGGG-—GGGC- 
CCTGAGGCAAATCTGTCAGTCCATCC7GGCTGAGTCCTCGCAGTCCCCGGCAGA~CT?G* 
Ao^GAAGGAAG.^AGACAGCAACATGAAGAGAGAGCAGCGCAGAGAGCGTCCC^GGGC 
^;JS CTACCCTCATGGCCTGGTTGGTTTAC ACAACAT7GGACAGACCTGCTGCCTT^ 
G I G ^;i" ATTCAGGTGTTCGT ^ TG ^ TG "GGACTTCACCAGGATAT7GAAGAGGATCAC 
GG7GCCCAGGGGAGC7GACGAGCAGAGGAGAAGCG7CCC777GCAGA7GC— C-GC-GC' 
CSGAGAAGATGCAGGACAGCCGGCAGAAAGC^GTGCGGCCCCTGGATCTGGCCTACTGCC' 
GCAGAAGTGCAACG7GCCC77G777G7CCAACA7GA7GC7GGCCAAC7GTACC T CAAA^'~ 

CTGGAACCTGATTAA«3ACCAGATCACTGATGTGCACTTGG7GGAGAGACTGCAGGCCCT 
G7A7A7GA7CCGGG7GAAGGACTCC77GAT77GCG77GAC7G7GCCA7GGAGAG-iGCA : 

AAACAGCAGCATGCTCACCCTCCCACTTTCTCTTTTTGATGTGGACTCAAAGCrcc^GA? 

GACACTGGAGGACGCCCTGCACTGCTTCTTCCAGCCCAGGGAGTTATCAAGCAA^^GCAA 

GTGCTTCTGTGAGAACTGTGGGAAGAAGACCCGTGGGAAACAGGTCTTGAAGCT-GACCC- 

TTTGCCCCAGACCCTGACAATCCACCTCATGCGATTCTCCATCAGGAATTCACAGACG^-:' 

AAAGATCTGCCACTCCCTGTACTTCCCCCAGAGCTTGGATTTCAGCCAGA^C- ^2- 

GAAGCGAGAG7C77G7GA7GC7GAGGAGCAGTC7GGAGGGCAG7A7GAGC~-7-TGC^G : - 

GA7TGCGCACG7GGGAA7GGCAGAC7CCGG7CA77AC7G7G7CTACA7CCGGAA'~G r "~'G"" 

C-GA7GGAAAA7GG77C7GC77CAA7GAC7CCAA7A777GC77GG7G7CCTGGGAAgPa- : 

CCAG7G7ACC7ACGGAAA7CC7AAC7ACCAC7GGCAGGAAAC7GCA7AT C -7C^GG7 :;: - : 

CATGAAGATGGAGTGCTAATGGAAATGCCCAAAACCTTCAGAGATTGACArGC-G'C^" 
-TCCATTTCCGTTCCTGGATCTACGGAGTCTTCTAAGAGATTTTGCA^TGic^i-aa-" - 

T7G7777CAAAC7A7A7AAC7GAGCC77AT77A7A.^77AGGGA7A77A7ckAAi-V- : G-' : 
ACCA7GAGGCCCC7CAGG7CC7GA7CAG7CAGAA7GGA7GC7~7CACCAGC^:;'— ~~ 

^ZtUt ' - CGG 1 CC 1 GGG 1 GC7CGC7GC7G7GCAAGACA77AGCCC77TAG~"^~': 
AGGC7 G7GGGAAC7 T C AGGGG77 CCC AGTGGGGAGAGC AG7GGC AG7GGG" "~~-~C^'~'~ 
GGGCCAAAGG7CAG7GGCAGGGGG7A777CAG7A77A7ACAAC7GC7G7GAC^AGAC'^~^ 

.A7r-.C7GGC7GAA7A7CAG7GC7G777G7AA77777CAC777GAGAACCAACi-~AZi-- : - 
GA7A7 G AAAAAAAAAAAAAAAAAAA * " ~ 
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GCGGCCGCTC7AGAAC7AG7GGA7CCCCCGGGC7GCAGGAA77CGCGGCCGC7AAA7GAA 

CTCCCATAAGAGTCTAGACACCATAGAACTCATACCAGGAATCACAAAGTCTCTAAATTT 

CCAAAGTTAACTGGAAATATTACAAACTGCAGAATAATTCCAGGCCAAAATATGTTAAA^ 

TCATAACATGATGTATATCAAAGGAAAAAAGGACATGTGGAAATGACACATTATC7TCAG 

TGTATAAAATATTCATTTATGTGAAGTTTCTTGGAAAGGCTACACTACTATTACTGGT7T 

CCGTCTGATGTTTGAGATCTGTTGATTTTATGCTTTTCTTACAGGCCTTTCATTATGATC 

TTTGGGAAGGAATCAATAAAATGATAGGGCCTACTTCATTAGGTGTGGTTCATTCCTAT7 

CATGC7CCCTGGAAGAACAAGAATGCTGAATTTTGAAATTTAATATTGTATGAATTAGCA 

7CAGGGAGAGG7GGAGAAAAA7ACAAAAC7AAAAG7CA7GC77A77G7G77CAG7G7GCC 

C77C7CCAGAGGGCCAC7GGC77A7AGGAAAGGA77GC7GC7C7ACCAG77GACCAGGAG 

A7GGCACGCCAGGACA77AAGACAC7GGAG7777G777CG77777777777777777GAG 

ATGGAG7CTCGC7C7C77GACAGGCAGGAG7ACAG7GG7GCGA7C7CGGC7CAC7GCAAA 

C7CCGCC7CCCGGG77CAAG7GA77C7CC7GCC7CGGCC7CCCGAG7AGC7GGGAC7ACA 

GGCG7G7GCCACCACCCCCAGC7AAC7777G7A77777AG7AGAGACAGGG777CACCA7 

GT7GGCCAGGA7GG7C7CAA7C7CTTGACC7CA7GA7CCGCCCGCC7CCGCC77CCAAAG 

7GC7GGGAT7ACAGGCG7GAGCCAGTGTGCCCGGCCGACAC7GGGC77777A7GAGAG7G 

ACAGA77AC7AGGACC7CA77A7GTGGTAGAAG7AA7G7AGGGGAAA7GGCGA77A7C77 

TT777AAAAGCAA7AGC7G77G7ATA7CAA7GA7AAA7GAAAAA77AG77A77C77G7AA 

ATTGAAGAAAGAA7GG77A7CA7AGAGGG7AGTTCAAG7AAAAGAACCAGGGC7GGG7G7 

GG7GGC7CACG7TC7G7AA7CCCTGTAC777GGGAGGCCAAGGCAGA7GGA7C7C77GAG 

GCCAGGAGT7CGAGACCAGCC7GACCAACA7GGCAAAACCG7G7C7C7ACAAAAAATACA 

AAAA77AGCCGGACA7CG7GG7AGA7GCC7G7AG7C7CAGA7A77CAGGAGACCGAGGGG 

AAAA7CACT7GAACCCGGGGGACGGAGG77GCAG7GAGC7GAGA7CGCACCAC7GC7CGC 

CAGCC7GGGCAACAGAG7GAGAC7C7GCC7CAAAAAAAAACCAAACCAAACCAAAGAACC 

AGAA7AGCA7G7GCACA7A7ACACAGACG777CACAAC7GGCA77A7G7777GC7AC7G7 

777A777ACAA7G7A7CACAAG7777ATGC777AA7AAAA777AA7CA7AAC77CAAAAA 
AAAAAAAAAAAAAAAGCGGCCGCGAA77 
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GAAT7CGGCACGAGGAAAACA777GCCCC77GCAGAAGA7CACCC77AG77C77CC T CG^ 
AAGAG7A7CAGAAGG7C7GGAACC7C777AACCGCACGC7GCC777C7ACT?7G7~C^GA 
AGATTGAGCGAGTACAGAACCTGGCCCTCTGGGAAGTCTACCAGTGGTGCGTTGGGGCTC 
GCTCTTGGTGGGCTGGTGACTCTGTCCCTTCACACCACTGGCTGGTTGCCACA7G T GGC" 
CGGGTTTCCAGGAAAAGCAGAGCGGCAGTTAGGGCTGCCATGTGCTGGGAGCTG-G T G T C 
TGCTCTCCTTCGTCCGCTCCCCCAGGGCAGTGTGGTAGCACATCCCATTGTAGAGPTGAG 
GGCACCGAGGCTTCCTGGAGCATACCACCTGGTCCCGTTCATGAGTGGTGGCAA.-GC" r A- 

CAC7C7CAC77G7CCA77C7GCC77CC7GGAGACCAG7GGGA7GGG7CAG7ACAGCCCAC 

CACACCATTAGCCCCAGGAACATAAGGCTGTGGCTAGACAGCAGGGGTCTCAGGTTCATA 

CATGAGGACTGGCTTGTCCTTGAGCACCCACTCACCTGTCTATGTGGGGAGGAA7CCTAC 

AATAGGTCACCA7GGCAGGCTGGGTCTTGCTGACCTGTCCCCAGATGGGGTTGGGG7AG7 

GTAATGTGTACTCTGTGCACAGTGATGAAGTCTGGGAATGGGAGAGGGGAGAAGGATGGG 

CACCCACTGACCAGCAGCCTGAAAATTCCTACAGCATCCCAGGGCTCAGCTCCATGCAGG 

AGCAAGG7GGGGG7GGGG77GGGGGAAA7G77ACCCA7777CCAAGGGC7GCTC~G-''~ — 

7GGAG7CCAGGGAACCGC7GC7G7C7GGAGC7G7GGAGGGAGGG777TCACCCAGC^CCC 

ACGA7CCCCC77C7777CCACACCC7GGC77G7GGC7GGAGCC77ACAGGCC7AG7CAGG 

GTAGCC7G7GACC7GCG7C7C77GG7CCCAGGACAC7TT7GGAA777TGGAAAAA7G^"G' T, 

TGTTTTGCATCAGGCCGGCTGTATTTGGTGGCCGGCACACTCTGCCCCCAGCACACATTC 

TTCTG7GAT7C7AGGCAAAAAGGACAGA7GCAGAAGCAGAACGGAGGGAAGGCCG7GGAC 

GAGCGGCAGC7G77CCACGGCACCAGCGCCA77777G7GGACGCCA7CTGCCAGCAGAAC 

TTTGACTGGCGGG7C7GTGG7G77CATGGCAC7TCCTACGGCAAGGGGAGCTACT7TGCC 

CGAGATGCTGCATATTCCCACCAC7ACAGCAAATCCGACACGCAGACCCACACGATGT7C 

CTGGCCCGGG7GC7GG7GGGCGAG77CG7CAGGGGCAATGCC7CC77TG7CCG7CCGCC3 

GCCAAGGAGGGCTGGAGCAACGCCTTCTATGATAGCTGCGTGAACAGTGTGTCCGACCCC 

TCCATCTTTGTGATCTTTGAGAAACACCAGGTCTACCCAGAGTATGTCATCCAGTACACC 

ACCTCCTCCAAGCCC7CGGTCACACCCTCCATCCTGCTGGCCTTGGGCTCCCTGTTCAGC 

AGCCGACAG7GAGCGCACAGGAG7G77CCAGGCC777CACC7GC7C7GCC77GAAA7GGC 

TATT7GGGCC777CC7777CT7777AAACAGAAAC77TTAA7GAAC7G77C7C77AACA7 

7GACC7C7CAA7GAAG77A7G77C77AA7C7C77GC7AA7AA7GA77T77AC7777AAG7 

CACT77TGGG77CAC7AG7GGA77AACCAGAAG7GA77G7AG77GAG7CCAG7777GC — 

TTTAA7AA7G7G77GAA37777AG777T7AC7C777G77GAC777GCTGC77A773GCAC 

CAGGGACAGAG777C7AGA7ACAA7777A7GGA77GG7777AAT777TA7GAG77TG7C" 

CTGCAG7GA7TCGG777C7CAGAG7C7CA7GGCA7CATAG777T7CCAGAA7GACACAG7 

AGCCACCGG7GGATGACAGCCCACGGGCGGCACAG7CAC77C7GCC7G77GCTC7GACAC 

CAACCCAGGCAGC7C7GC7G7GGC77C7CC7GGGC7C7GGCA77AG77GG7C7G7G7C.-.3 

A7TG7CAGAACAGGTGGC7GC7G7GTGG7GCCA7CGAG7CCC7GCTGG77CCCC77G7CC 

7GGGAGGG7CACCCA77GCCCAAGGAAGTGCA7CCACC7GGCAGG7GACC7GGAGGAG7A 

GC77CCCCGAGGACCCCCAGGC77GGCC7G7GA77GCGCAAACCCACA77TCC7AAGCAC 

ACTGGACACCC77CGAG7G7GGG7777AACA7CCC7G7GAGA77GAA7AC77G7GCCACA 

CATG7CACAAAAGAG7A7GGAAA7AAAAGAAAA777ATCCGAAAAAAAAAAAAAAAAAA - 
GAGCGGCCGC 
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CCGGTCTATGGCATTAACCCTCACTTAACTTTTCAGCCTGCCAGCCTGCCCTATGGATTT 
CGGACTTGCCAGCCACACAATTCCTTAAAATAAATCTCTCCGTCTCATAAAAA 
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CCGGGAGCTGTGAAGGGAACGTGAGGGGGCGGCGTAGTGGAGACCCACGGCAGGCCTGAA 

GAAGAGCGGCGGCCGAGCCCGCCTTCCCTGCACCATGCTCATAGAGGATGTGGATGCCCT 

CAAGTCCTGGCTGGCCAAGTTACTGGAGCCGATATGTGATGCTGATCCTTCAGCCTTAGC 

CAACTATGTTGTAGCACTGGTCAAGAAGGACAAACCTGAGAAAGAATTAAAAGCCT7TTG 

TGCTGATCAACTTGATGTCTTTTTACAAAAAGAAACTTCAGGTTTTGTGGACAAACTATT 

TGAAAGTCTCTATACTAAGAACTACCTTCCACTTTTGGAACCAGTAAAGCCTGAGCCAAA 
ACCACTAGCCCAAGAAAAA 
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ATGGATAGTCGCCACACCTTTGCCCCTGCTGCGATGACCCTGTCGCCACTTCTGCTGTTC 
CTGCCACCGCTGCTGCTGCTGCTGGACGTCCCCACGGCGGCGGTGCAGGCGTCCCCTCTG 
CAAGCGTTAGACTTCTTTGGGAATGGGCCACCAGTTAACTACAAGACAGGCAATCTATAC 
CTGCGGGGGCCCCTGAAGAAGTCCAATGCACCGCTTGTCAATGTGACCCTCTACTATGAA 
GCACTGTGCGGTGGCTGCCGAGCCTTCCTGATCCGGGAGCTCTTCCCAACATGGCTGTTG 
GTCATGGAGATCCTCAATGTCACGCTGGTGCCCTACGGAAACGCACAGGAACAAAATGTC 
AGTGGCAGGTGGGAGTTCAAGTGCCAGCATGGAGAAGAGGAGTGCAAATTCAACAAGGTG 
GAGGCCTGCGTGTTGGATGAACTTGACATGGAGCTAGCCTTCCTGACCATTGTCTGCATG 
GAAGAGTTTGAGGACATGGAGAGAAGTCTGCCACTATGCCTGCAGCTCTACGCCCCAGGG 
CTGTCGCCAGACACTATCATGGAG7GTGCAATGGGGGACCCCGGCATGCAGCTCATGCAC 
GCCAACGCCCAGCGGACAGATGCTCTCCAGCCACCACACGAGTATGTGCCCTGGGTCACC 
GTCAATGGGAAACCCTTGGAAGATCAGACCCAGCTCCTTACCCTTGTCTGCCAGTTGTAC 
CAGGGCAAGAAGCCGGATGTCTGCCCT7CCTCAACCAGCTCCCTCAGGAGTGTTTGCTTC 
AAGTGATGGCCGGTGAGCTGCGGAGAGCTCATGGAAGGCGAGTGGGAACCCGGCTGCCTG 
CCTTTTTTTTCTGATCCAGACCCTCGGCACCTGCTACTTACCAACTGGAAAATTTTATGC 
ATCCCATGAAGCCCAGATACACAAAATTCCACCCCATGATCAAGAATCCTGCTCCACTAA 
GAATGGTGCTAAAGTAAAACTAGTTTAATAAGCAAAAAAAAAA-AAAAAAAATTCCTGCG 
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CCGGGATATCGCCACTGCACTCCAGCCTGGGTGACGGAGCGAGACTCCGTCTCAGAAAAA 
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MHWEEAQI SRAVLSLPRI DLCVSPNKLTYS PK 
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MEFNTTHYREFGPRGQEFGTRQQQQQKKTEHLHITDTQFKKQNITAPSRIFLGSLPSLLT 
PDYKQPPPISPDIVLYESSSSQMGLFCPLGTLGSIWRH* 
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MIVQMTVILKLEMPQDSLILEKSQNWSSQKMDHILICCVCLGDNSEDADEI IQCDNCGIT 

VHEGCYGVDGESDSIMSSASENSTEPWFCDACKCGVSPSCELCPNQDGI FKETDAGRWVH 

IVCALYVPGVAFGDIDKLRPVTLTEMNYSKYGAKECSFCEDPRFARTGVCISCDAGMCRA 

YFHVTCAQKEGLLSEAAAEEDIADPFFAYCKQHADRLDRKWKRKNYLALQSYCKMSLQER 

EKQLSPEAQARINARLQQYRAKAELARSTRPQAWVPREKLPRPLTSSASAIRKLMRKAEL 

MGISTDI FPVDNSDTSSSVDGRRKHKQPALTADFVNYYFERNMRMIQIQENMAEQKNIKD 

KLENEQEKLHVEYNKLCESLEELQNLNGKLRSEGQGIWALLGRITGQKLNIPAILRAPKE 

RKPSKKEGGTQKTSTLPAVLYSCGICKKNHDQHLLLLCDTCKLHYHLGCLDPPLTRMPRK 

TKNSYWQCSECDQAGSSDMEADMAMETLPDGTKRSRRQIKEPVKFVPQDVPPEPKKIPIR 

NTRTRGRKRSFVPEEEKHEERVPRERRQRQSVLQKKPKAEDLRTECATCKGTGDNENLVR 

CDECRLCYHFGCLDPPLKKSPKQTGYGWICQECDSSSSKEDENEAERKNISQELN M EQKN 
PKK 
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14/23 

Name: I MX 28 

MSKAFGLLRQICQSILAESSQSPADLEEKKEEDSNMKREQPRERPRAWDYPHGLVGLHNI 
GQTCCLNSLIQVFVMNVDFTRILKRITVPRGADEQRRSVPFQMLLLLEKMQDSRQKAVRP 
LELAYCLQKCNVPLFVQKDAAQLYLKLWNLIKDQITDVHLVERLQALYMIRVKDSLICVD 
CAMESSRNSSMLTLPLSLFDVDSKPLKTLEDALHCFFQPRELSSKSKCFCENCGKKTRGK 
QVLKLTHLPQTLTIHLMRFSIRNSQTRKICHSLYFPQSLDFSQILPMKRESCDAEEQSGG 
QYELFAVIAHVGMADSGHYCVYIRNAVDGKWFCFNDSNICLVSWEDIQCTYGNPNYHWQE 
TAYLLVYMKMEC* 



FIGURE 14 
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Name: I MX 32 

MAAALLPSGQNWHNTGFILESNLTNVMKWRLFIKCPCLWGHEKIHTESIKNVLNMERPL 
SNSDVMKWVF* 



FIGURE 15 
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16/23 

Name: I MX 39 

MCTLCTVMKSGNGRGEKDGHPLTSSLKI PTASQGSAPCRSKVGVGLGEMLPI FQGLLCFW 
SPGNRCCLELWREGFHPAPTIPLLFHTLACGWSLTGLVRVACDLRLLVPGHFWNFGKMCC 
FASGRLYLVAGTLCPQHTFFCDSRQKGQMQKQNGGKAVDERQLFHGTSAIFVDAICQQNF 
DWRVCGVHGTSYGKGSYFARDAAYSHHYSKSDTQTHTMFLARVLVGEFVRGNASFVRPPA 
KEGWSNAFYDSCVNSVSDPS I FVI FEKHQVY PEYVIQYTTSSKPSVTPS I LLALGSLFSS 

RQ 



FIGURE 16 
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Name: I MX 40 

MPVYGINPHLTFQPASLPYGFRTCQPHNSLK 



FIGURE 17 
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18/23 

Name: I MX 42 

MLIEDVDALKSWLAKLLEPICDADPSALANYVVALVKKDKPEKELKAFCADQLCVFLQK^ 
TSGFVDKLFESLYTKNYLPLLEPVKPEPKPLAQEK* 



FIGURE 1 S 
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19/23 

Name: I MX 44 

(MD) SRHTFAPAAMTLSPLLLFLPPLLLLLDVPTAAVQASPLQALDFFGNGPPVNYKTGN 

LYLRGPLKKSNAPLVNVTLYYEALCGGCRAFLIRELFPTWLLVMEILNVTLVPYGNAQEQ 

NVSGRWEFKCQHGEEECKFNKVEACVLDELDMELAFLTI VCMEEFEDMERSLPLCLQLYA 

PGLSPDTIMECAMGDPGMQLMHANAQRTDALQPPHEYVPWVTVNGKPLEDQTQLLTLVCQ 
LYQGKKPDVCPSSTSSLRSVCFK 



FIGURE 19 
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20/23 

Name: I MX 56 

MPGYRHCTPAWVTERDSVSEK* 



FIGURE 20 
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Match to •Tnr s on PAC clone that carries ApoL genet 
>emb|Z822X5 (HS6802 Bono itpitai DHX sequence from PAC 6802 on 
chromosome 22 . 

Contains apolipoprotein h, myosin heavy chain, XSTs, CX 

repeat, STS 

and GSS, complete sequence [Homo sapiens} 
Length - 139,389 



Match of Ouexyi 2 

AOGAGCI'UICIUUX IATTATACACACGCATAACTGGAGGTGGGATCCACACAGCTCAGAA 61 

I I 

Ml III I I I I I I II I I 1 I Ml I I Mill il MM Ml I II III III M II II M 

Sbjct i 2X414 

AGGACC^LnvrCTCGTTATTATACAQACGCATAACTO 21473 
Query i 62 CAGCTGGAT Cri ' UCl ' C A G rCIVI\^ CAGOCGAAC A ' r ' l * CC , ri , U G 104 

MINI Mill MIIIMIIIIMIIIIMIIII lllllll M 

Sbjct z 2X474 CAGCTGGATCTTGCTCACrrCTCTCCCAGGGGAAGATTCCTTCG 21516 



Query s X03 

<WAGGAgCCCCTCCAGCraCAT<raAGaa^ 162 

CQAGGASGCCCTGCAGCGACATGQACGGAOCT CC ' riU^C TO 
Sbjctx 23232 

(aXAGQAGGCCCTGCAGCGACATG<^^ 23291 

Query i X63 TCTGGATGAG X72 

TCTGG TQAO 
Sbjct: 23292 TCTGG-TGAG 23300 

Query s X2X ACATGGAGGGAGCTGCrWGCTGAOAGTCTCTGTCCT- 
CTGCATCTGGATGAGTGCACrr X79 

A ATOG G T CT TO T A TCTC TC T C CA C 
GGXTGAOTGCACTT 

Sbjct: 25333 AGATGGCTGCCCGTCCTCTGATTATCTTCTCC- 
TGATACCCCAACAGGATGAGTGCACTT 25391 

Query: X80 TTCCTTGNTNTNOHAHTHANGNCJtGAGGAAGCK 221 

TTCCTTG T T G A T A G CAQACQAAGCTGGAGCGAOG 
Sbjcti 25392 TTCCTTGGTGTGGGAGTGAGGGCAGAGGAAGCTGGAGCGAGG 25433 



Query i 217 

CGAGGGTGCAACAAAACGTTCCAAGTGGGACAGATACTGGAGATCCTCAAAGTAAGCCCC 276 

I 

I f 1 1 1 1 1 I I 1 I 1 !! I I f I f 1 1 1 I I 1 1 ! 1 1 1 1 i 1 1 1 1 1 1 1 ! 1 1 I i 1 1 1 1 1 1 f 1 1 1 1 1 1 1 

Sbjct t 256XX 

CAAGGGTGCAACAAAACGTTCCAAGTGGGACAGATACTGGAGATCCTCAAAGTAAGCCCC 25670 
Query: 277 TCGGTGACTGGGCTGCTGGCACCATGGACCCAGAGAG 3X3 

t ! I I 1 1 1 i I 1 I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I J || 

Sbjct: 2567X TCGGTGACTGGGCTGCTOGCACCATGGACCGAGGTAG 25707 
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Query: 3 04 

^CCCAGAGAGCAGTATCTTTATTGAGGATGCCATTAAGTATTTCAAGGAAAAAGTGAGCA 3 6 3 

!1 ] N M I M I I I I I I I I ! I i M I ! ! I ! I I ! I I i I I I M f 1 1 I ; I ! | 1 1 ! ! i | | i ! 

Sbjct: 29886 

AACTAGAGAGCAGTATCTTTATTGAGGATGCCATTAAGTATTTCAAGGAAAAAGTGAGCA 2 9 94 5 
Query: 3 64 

CACAGAATCTGCTACTCCTGCTGACTGATAATGAG<3CCTGGAACGGATTCGTGGCTGCTG 423 

II ! N I I I I I 1 1 M I I ! I ! I I I M M I ii M 1 1 I I I ! I I i I 1 1 1 I I I I M I i II I I I I I I 

Sbjct: 29946 

^^^^^^^^^^^TACTCCTGCTGACTGATAATGAGGCCTGGAACGGATTCGTGGCTGCTG 3 00 0 5 
Query: 4 24 CTGAACTGCCCAGGAATG 441 

M I M 1 1 i ! II I I I I I 

Sbjct: 30006 CTGAACTGCCCAGGTAAG 30023 
Query: 430 

'^^^^^^''^T^A^^CAGATGAGCTCCGTAAAGCTCTGGACAACCTTGCAAGACAAATGA 4 8 9 

TG 

CAGGAATGAGGCAGATGAGCTCCGTAAAGCTCTGGACAACCTTGCAAGACAAATGA 
Sbjct: 33440 

TGTGCAGGAATGAGGCAGATGAGCTCCGTAAAGCTCTGGACAACCTTGCAAGACAAATGA 3 34 9 9 
Query: 490 

TCATGAAAG ACAAAAACTGGCACGATAAAGGCCAGCAGTACAGAAACTGGTTT CTGAAAA 54 9 

TCATGAAAGACAAAAACTGGCACGATAAAGGCCAGCAGTACAGAAACTGGTTTCTGAAA 
Sbjct: 33500 

TCATGAAAG ACAAAAACTGGCA CG AT AAAGGCCAGCAGTACAGAAA CTGG TTT CTG AAA - 3 3558 
Query: 550 

GAGTTTCCTCGGGTGAAAAAGTAAGCTTGAGGATAACATAAGAAAGCTTCCGTGCCCTTG 6 0 9 

GAGTTTCCTCGG TGAAAA GT AGCTTGAGGATAACATAAGAA GCT 



Sbjct: 33 559 G AGTTTCCTCGGTTGAAAA - GTGAGCTTGAGGATAACATAAG AAGGCT - 
CCGTGCCCTTG 3 3 616 

Query: 610 CAANATGGG 618 

CA ATGGG 
Sbjct: 33617 CA - G ATGGG 33624 
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SEQUENCE LISTING 



<110> Baum, Peter 

DuBose, Robert 
Sims, John E 
Youakim, Adel 
Hasel, Karl w 
Hilbush, Brian S 

<120> Novel DNAs and Polypeptides 
<130> 98,664-A 



<140> 

<141> 1999-11-10 

<150> 60/107821 

<151> 1998-11-10 



<160> 38 



<170> Patentln Ver. 2.1 

<210> 1 
<211> 137 
<212> DNA 

<213> Homo sapiens 



<400> 1 



ccggtaagta aacagtcaga aaattagcat gaaagcagtt tagcattggg aggaagcaca 60 
gatctctaga gctgtcctgt cgctgcccag gattgacctg tgtgtaagtc ccaataaact 12 



0 

137 



cacctactca ccaaaaa 

<210> 2 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 2 

cggttcttga gcccagtaga tgccatttga agaaaaaaat cacttgaaaa tgagacagaa 60 
agaatggaaa ctaaatccta gctctaaagg caccaggctg attaaaaa i 0 8 

<210> 3 
<211> 306 
<212> DNA 

<213> Homo sapiens 



<400> 3 

cggcttcagc agtatcgtgc caaagcagaa 
gttccaaggg aaaaattgcc cagaccactc 
atgcggaaag cagaactcat ggggatcagt 
actagttcta gtgtggatgg aaggagaaaa 
gtaaattatt attttgagag aaacatgcgc 
caaaaa 



ctagctcgat ctaccagacc ccaggcctgg 60 
accagcagtg cttcagctat tcgtaaactt 120 
acagatatct ttccagtgga caattcagat 180 
cataagcaac cagctctcac tgcagatttt 240 
atgattcaaa ttcaggaaaa tatggctgaa 300 

306 



<210> 4 

<211> 366 

<212> DNA 

<213> Homo sapiens 



i 
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PCT/US99/26788 



<400> 4 

cggccatgtg 

gttatgagcc 

atctgggggc 

gacttgtata 

taattccata 

gatcatctct 

caaaaa 

<210> 5 
<211> 324 
<212> DNA 
<213> Homo 

<400> 5 

cggccgacac 

agtaatgtag 

gataaatgaa 

gttcaagtaa 

gggaggccaa 

ggcgagaccg 



gytgctcggt 

tgtgggaact 

caaaggtcag 
ctggctgaat 
cgaatcaagt 
tctccataag 



cctkggttgc 
tcaggggttc 
tggcaggggg 
atcagtgctg 
gttttgtaac 
atagtgtgat 



tcgcttgctg 
ccagtgggga 
tatttcagta 
tttgtaattt 
tgctattcat 
aaacacagtc 



tgcaagacat 
gagcagtggc 
ttatacaact 
ttcactttga 
ttattcagca 
atgaataaag 



tagcccttta 
agtgggaggc 
gctgtggcca 
gaaccaacat 
aatatttatt 
ttatttccac 



sapiens 



tgggct tttt 

gggaaatggc 
aaattagtta 
aagaaccagg 
ggcagatgga 
tgtctcccca 



atgagagtga cagattacta ggacctcatt atgtggtaga 
gattatcttt ttttaaaagc aatagctgtt gtatatcaat 
ttcttgtaaa ttgaagaaag aatggttatc atagagggta 
gctgggtgtg gtggctcacg ttctgtagtc cctgtacttt 
tctcttgagg ccaggagttc gagaccagcc tgaccaacat 
aaaa 



60 
120 
180 
240 
300 
360 
366 



60 

120 

180 

240 

300 

324 



<210> 6 
<211> 398 
<212> DNA 

<213> Homo sapiens 



<400> 6 

cggtggatga cagcccacgg gcggcacagt 

aggcagctct gctgtggctt ctcctgggct 

agaacaggtg gcctgtgtgg tgccatcgag 

tcacccattg cccaaggaag tgcatccacc 

gaggaccccc aggcttggcc tgtgattgcg 

cccttcgagt gtgggtttta acatccctgt 

aaaagagtat ggaaataaaa gaaaatttat 

<210> 7 

<211> 113 

<212> DNA 

<213> Homo sapiens 



cacttctgcc tgttgctctg acaccaaccc 60 

ctggcattag ttggtctgtg tcacattgtc 120 

tccctgctgg ttccccttgt cctgggaggg 180 

tggcaggtga cctggaggag tagct tcccc 24 0 

caaacccaca tttcctaagc acactggaca 300 

gagattgaat acttgtgcca cacatgtcac 360 

ccgaaaaa iqa 



ccggtctatg gcattaaccc tcacttaact tttcagcctg ccagcctgcc ctatggattt 60 
cggacttgcc agccacacaa ttccttaaaa taaatctctc cgtctcataa aaa n 



<210> 8 
<211> 379 
<212> DNA 

<213> Homo sapiens 
<400> 8 

ccgggagctg tgaagggaac gtgagggggc ggcgtagtgg agacccacgg caggcctgaa 60 
gaagagcggc ggccgagccc gccttccctg caccatgctc atagaggatg tggatgccct 120 
caagtcctgg ctggccaagt tactggagcc gatatgtgat gctgatcctt cagccttagc 180 
caactatgtt gtagcactgg tcaagaagga caaacctgag aaagaattaa aagccttttg 240 
tgctgatcaa cttgatgtct ttttacaaaa agaaacttca ggttttgtgg acaaactatt 300 
cgaaagtctc tatactaaga actaccttcc acttttggaa ccagtaaagc ctgagccaaa 360 
accactagcc caagaaaaa 

379 



10 



20 



50 



55 



60 
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<210> 9 
<211> 180 
<212> DNA 

<213> Homo sapiens 
<400> 9 




<210> 10 

15 <211> 60 

<212> DNA 

<213> Homo sapiens 



<400> 10 

ccgggatatc gccactgcac tccagcctgg gtgacggagc gagactccgt ctcagaaaaa 60 



<210> 11 
<211> 
25 <212> DNA 

<213> Homo sapiens 

<400> 11 

1 tcaatcctgg gcggcgacaa gacagctcta gagatctgag cttcctccca 
51 ?" ttCat9 ° taatttt "9 actgtttact taccgggtaa 

< 

451 tcctgtatcc aactgttcta actcttgggc tttctccatt ttcagctctt 



101 



201 
251 
35 301 
351 



- = — — ww^^w-^y a^Ly L.LLQL.L tdccgggcaa 

gagcgatggg actgttttca ttggttggtt ctcacatact ctctgggaag 
tttgggttct cagggacacc tgctcctcag ctggggacca tggccatggc 
ccaccacctg cccttcagtg ttcaagcagg ggacatgcac cctttagtaa 
cctggagggg acccatcaca tgacaaccac cccaacgacc atcatcagga 
agccgctgcc tgactgagat atgcccccag gaggacaagg gagagtggat 
gctggaaaga cagggcaggg gaccatcacc agggaaagac ttcattcttc 



501 tcttttcctg gccttctcat tgctggntcc ttcaagcctc cnctctatnc 
4U 551 ttccgncaat atattcttt 



<210> 12 
<211> 

45 <212> DNA 

<213> Homo sapiens 



<400> 12 






1 


cggcgacaag 


acagctctag 


51 


ctttcatgct 


aattttctga 


101 


ctgttttcat 


tggttggttc 


151 


agggacacct 


gctcctcagc 


201 


ccttcagtgt 


tcaagcaggg 


251 


cccatcacat 


gacaaccacc 


301 


gactgagata 


tgcccccagg 


351 


agggcagggg 


accaccacca 


401 


aacctggggc 


tgggtctgta 


451 


actgttctaa 


ctcttgggct 


501 


ccttctcatt 


gctggttcct 


551 


tattcttttt 


tttttttttt 


601 


agctggagtg 


ca 



3 
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<210> 13 
<211> 
<212> DNA 

<213> Homo sapiens 

<400> 13 

1 cacgagctgt ctggttatta 

51 acagctcaga acagctggat 

101 ttggaggagg ccctgcagcg 

151 ctgtcctctg catctggatg 

2 01 gcagaggaag ctggagcgag 

251 tactggagat cctcaaagta 

301 tggacccaga gagcagtatc 

351 gaaaaagtga gcacacagaa 

4 01 ctggaacgga ttcgtggctg 

451 agctccgtaa agctctggac 

501 aaaaactggc acgataaagg 

551 agtttcctcg ggtgaaaaag 

601 gtgcccttgc aanatggg 



tacagacgca taactggagg tgggatccac 
cttgctcagt ctctgccagg ggaagattcc 
acatggaggg agctgctttg ctgagagtct 
agtgcacttt tccttggtgt gggagtgagg 
ggtgcaacaa aacgttccaa gtgggacaga 
agcccctcgg tgactgggct gctggcacca 
tttattgagg atgccattaa gtatttcaag 
tctgctactc ctgctgactg ataatgaggc 
ctgctgaact gcccaggaat gaggcagatg 
aaccttgcaa gacaaatgat catgaaagac 
ccagcagtac agaaactggt ttctgaaaag 
taagcttgag gataacataa gaaagcttcc 



<210> 14 
<211> 1044 
<212> DNA 

<213> Homo sapiens 



<400> 14 

gaatttaata cgactcacta tagggaattt 
caacaacaac aacaaaaaaa aactgaacat 
caaaatatta cagccccttc caggatattc 
gactacaaac agcctccacc tatttcacct 
cagatggggc tattttgccc cctggggaca 
tgtctacttg gggggagttg tgttactgca 
ctcaacatcc tgaaatgcac agggaacccc 
aaatgtcagc agtgtcacag ctgacaccct 
tttcgggctc aggatctttt tcattctaat 
aggtaggtac agtccacaac aaagaagaac 
ggtgccttta gagctaggat ttagtttcta 
ttcttcaaat ggcatctact gggctcaaga 
atgggaattt tgtacacacc cacacaggta 
cacagataca cacatccgga gaccaagaca 
ccccaaacag gagaaacgca ccattcactc 
tctgatagtc aggctgttgc caagcccagc 
atggattttt tctttactca agaatataga 
ccgcaagctt attcccttta gtga 



ggccctcgag gccaagaatt cggcacgagg 60 
ctccatatta ctgacaccca attcaagaaa 120 
ct ggggtctc ttccatctct actaacccct 180 
gacattgtac tttatgaaag cagcagttct 240 
ttagggagta tctggagaca ctgagggttg 300 
tccagtgagt ccagggatcc agggatgccg 360 
cacacataga acagagaaat tgctgagcca 42 0 
gatatacaca ctatcacaca gtatctgctc 480 
catctcatag gaaacagaaa tgtcatttag 540 
ctgagttttt tttttttttt taatcagcct 600 
ttctttctgt ctcattttca agtgattttt 660 
actggagatc cccacaaagc tgagattcac 720 
tacacttcca tttacatgca gacatccacc 780 
gaacgcaaac tgccccataa aagcacggtt 840 
cagggaggta tctatttgtt taattcagcc 900 
tctgaaactc ttcccctcta ggaaagaaag 960 
tctaaaaaaa aaaaaaaaaa aagttggcgg 1020 

1044 



<210> 15 
<211> 2577 
<212> DNA 

<213> Homo sapiens 
<400> 15 

ggtaccgggc cccccctcga ggtcgacggt atcgataagc ttgatatcga attcgcggcc 60 
gctgagaaat taactccccg gggccgccgg gttgactgcg ctgcctgggc cggaggtctt 120 
ctccggccag ggagcgctgt gggaaggggc tcgagcggcc agggccaggc gaggccgggg 18 0 
gggcgggggg ttaggggacc gcggggctac tcttgggagc gcccctgtcc ggctggctgc 240 
gcgccggttt taaatagcat ctttcggact tgtcttcgcg gccccagtcc ccgacctcgg 300 
cgctgcctgg gctcctgcag cctctcccta agtcttctcc aaacgaccac ctcacggatt 360 
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ccttacggat cgcagctcca agaggaggca ggtgaagcct ttggcagctt ctctgccgga 420 
agctcttgat catgatagtt cagacgacag tgattttaaa gtnggagatg cctcaggact 4 80 
cgctgattct tzgagaagagt caaaactgga gctctcaaaa aatggaccat attctgattt 540 
gcrgtgtttg tccgggagat aatagtgagg acgctgatga aataattcag tgtgacaatt 600 
gtggcattac agtccatgaa ggttgttatg gagttgatgg agagagtgac tctattatga 660 
gttcagcttc tgaaaactcc actgaacctt ggttttgtga tgcccgtaaa tgtggtgttt 720 
ctcctagctg tgaactgtgt cctaatcagg atggaattct caaggagaca gacgctgqaa 780 
gacgggttca tattgtttgt gccctgcatg ttcctggagt agcctttgga gatattgaca 840 
aattacgacc agtaacacta acggaaatga actattccaa atatggtgcc aaggagtgta 900 
gctttcgtga agaccctcgc tttgctagaa ctggggcttg cattagctgt gatgcaggga 960 
tgtgcagagc ctatttccat gtgacctgtg ctcaaaagga aggtctgctt tcagaggcag 1020 
cggcggaaga ggatatagca gatccattct tcgcttattg taagcaacat gcagataggt 1080 
tagacagaaa gtggaagaga aaaaactact tggctctaca gtcctattgt aaaatgtctt 1140 
tgcaagagag agagaagcaa ctatcaccag aagcacaggc aaggatcaat gcccggcttc 1200 
agcagtatcg tgccaaagca gaactagctc gatctaccag accccaggcc tgggttccaa 1260 
gggaaaaatt gcccagacca ctcaccagca gtgcttcagc tattcgtaaa cttatgcgga 1320 
aagcagaact catggggatc agtacagata tctt tccagt ggacaattca gatactagtt 1380 
ctagtgtgga tggaaggaga aaacataagc aaccagctct cactgcagat tttgtgaatt 1440 
attattttga gagaaatatg cgcatgattc aaattcagga aaatatggct gaacaaaaga 1500 
atataaaaga taaattagag aatgaacaag aaaagcttca tgtagaatat aataagctat 1560 
gtgaatcttt agaagaacta caaaacctga atggaaaact tcgaagtgaa ggacaaggaa 1620 
tatgggcttt actaggcaga atcacagggc agaagttgaa tataccggca attt tgcgag 1680 
cacccaagga gagaaaacca agtaaaaaag aaggaggcac acaaaagaca tctactcttc 1740 
ctgcagtact ttatagttgt gggatttgta agaagaacca tgatcagcat cttcttttat 1800 
tgtgtgatac ctgtaaacta cattaccatc ttggatgtct ggatcctcct cttacaagga 1860 
tgccaagaaa gaccaaaaac agttattggc agtgctcgga atgtgaccag gcagggagca 1920 
gtgacatgga agcagatatg gccatggaaa ccctaccaga tggaaccaaa cgatcaagga 1980 
ggcagattaa ggaaccagtg aaatttgttc cacaggatgt gccaccagaa cccaagaaga 2040 
ttccgataag aaacacgaga accagaggac gaaaacgaag cttcgttcct gaggaagaaa 2100 
aacatgagga aagagttcct agagagagaa gacaaagaca gtctgtgttg caaaagaagc 2160 
ccaaggctga agatttaaga actgaatgtg caacttgcaa gggaactgga gacaatgaaa 2220 
atcttgtcag gtgtgatgaa tgcagactct gctaccattt tggctgtttg gatcctcctt 2280 
tgaaaaagtc tcctaaacag acaggctacg gatggatatg tcaggaatgt gattcttcat 2340 
cttccaagga agatgaaaat gaagctgaaa gaaaaaatat atctcaggag ctcaacatgg 24 00 
aacagaaaaa tccaaagaaa taaaagattt tctgtagtgt ttttgaaaag tttgcagctt 2460 
atgtaatagc agataaaatt tctaattgta aaatgttaaa ttgagcggcc gcgaattcct 2520 
gcagcccggg ggatccacta gttctagagc ggccgccacc gcggtggagc tccagct 2577 



<210> 16 

<211> 2065 

<212> DNA 

<213> Homo sapiens 



<400> 16 

attaaccctc actaaaggga acaaaagctg 
actagtggat cccccgggct gcaggaattc 
9cgggaagct cgggccggca gggtttcccc 
gccgctgtaa gtttcgcttt ccattcagtg 
c 9cggggcca gaccaaggcg ggcccggagc 
cagtcccgac gtggaactca gcagcggagg 
catcgtgcct ggctcacata agcgcttcct 
gccaggcagc tgcggcctgg gggttttgga 
cctgaggcaa atctgtcagt ccatcctggc 
agaaaagaag gaagaagaca gcaacatgaa 
ctgggactac cctcatggcc tggt tggttt 
ctccttgatt caggtgttcg taatgaatgt 
ggtgcccagg ggagctgacg agcagaggag 
ggagaagatg caggacagcc ggcagaaagc 
gcagaagtgc aacgtgccct tgtttgtcca 
ctggaacctg attaaggacc agatcaccga 



gagctccacc gcggtggcgg ccgctctaga 60 
ggcacgaggt gcgcggctgc aacggcagcc 120 
gcacgctggc gcccagctcc cggcgcggag 180 
gaaaacgaaa gctgggcggg gtgccacgag 24 0 
ggaacttcgg tcccagctcg gtccccggct 300 
ctggacgctt gcatggcgct tgagagattc 360 
ggaagtgaag tcgtgctgtc ctgaacgcgg 420 
gtgatcacga atgagcaagg cgtctgggct 480 
tgagtcctcg cagtccccgg cagatcttga 54 0 
gagagagcag cccagagagc gtcccagggc 600 
acacaacatt ggacagacct gctgccttaa 660 
ggacttcacc aggatattga agaggatcac 720 
aagcgcccct ttccagatgc ttctgctgct 780 
agtgcggccc ctggagccgg cctactgcct 840 
acatgatgct gcccaactgt acctcaaact 900 
tgtgcacttg gtggagagac cgcaggccct 960 



# 
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gtatatgatc cgggtgaagg accccttgac ttgcgctgac tgtgccatgg agagcagcag 1020 
aaacagcagc atgctcaccc ccccaccctc tctttttgac gtggactcaa agcccctgaa 1080 
gacactggag gacgccctgc actgcttctt ccagcccagg gagctatcaa gcaaaagcaa 1140 
gtgcttctgt gagaactgtg ggaagaagac ccgtgggaaa caggtcttga agctgaccca 1200 
tttgccccag accctgacaa tccacctcac gcgattctcc atcaggaatt cacagacgag 1260 
aaagatctgc cactccctgt acttccccca gagcttggat ttcagccaga tccttccaat 1320 
gaagcgagag tcttgtgatg ctgaggagca gtctggaggg cagcatgagc tttttgctgt 1380 
gactgcgcac gtgggaatgg cagactccgg tcattactgt gtctacatcc ggaatgctgt 1440 
ggatggaaaa tggttctgct tcaatgactc caatatttgc ttggtgtcct gggaagacat 1500 
ccagtgtacc tacggaaatc ctaactacca ctggcaggaa actgcatacc ttctggttta 1560 
catgaagatg gagtgctaat ggaaatgccc aaaacctcca gagattgaca cgctgtcatt 1620 
ttccatttcc gttcctggat ctacggagtc ttctaagaga ttttgcaatg aggagaagca 1680 
ttgttttcaa actatataac tgagccttat ttataattag ggatattatc aaaatatgta 1740 
accatgaggc ccctcaggtc ctgatcagtc agaatggatg ctttcaccag cagacccggc 1800 
catgtggctg cccggtcctg ggtgctcgct gctgtgcaag acattagccc tttagttatg 1860 
agcctgtggg aacttcaggg gttcccagtg gggagagcag tggcagtggg aggcatctgg 1920 
gggccaaagg tcagtggcag ggggtatttc agtattatac aactgctgtg accagacttg 1980 
tatactggct gaatatcagt gctgtttgta atttttcact ttgagaacca acattaattc 2040 
catatgaaaa aaaaaaaaaa aaaaa 2065 



<210> 17 

<211> 1588 

<212> DNA 

<213> Homo sapiens 



<400> 17 

gcggccgctc tagaactagt ggatcccccg 
ctcccataag agtctacaca ccatagaact 
ccaaagttaa ctggaaatat tacaaactgc 
tcataacatg atgtatatca aaggaaaaaa 
tgtataaaat attcatttat gtgaagtttc 
ccgtctgatg tttgagatct gttgatttta 
tttgggaagg aatcaataaa atgatagggc 
catgctccct ggaagaacaa gaatgctgaa 
tcagggagag gtggagaaaa atacaaaact 
cttctccaga gggccactgg cttataggaa 
atggcacgcc aggacattaa gacactggag 
atggagtctc gctctcttga caggcaggag 
ctccgcctcc cgggttcaag tgattctcct 
ggcgtgtgcc accaccccca gctaactttt 
gttggccagg atggtctcaa tctcttgacc 
tgctgggatt acaggcgtga gccagtgtgc 
acagattact aggacctcat tatgtggtag 
tttttaaaag caatagctgt tgtatatcaa 
attgaagaaa gaatggttat catagagggt 
ggtggctcac gttctgtaat ccctgtactt 
gccaggagtt cgagaccagc ctgaccaaca 
aaaattagcc ggacatcgtg gtagatgcct 
aaaatcactt gaacccgggg gacggaggtt 
cagcctgggc aacagagtga gactctgcct 
agaatagcat gtgcacatat acacagacgt 
tttatttaca atgtatcaca agttttatgc 
aaaaaaaaaa aaaaagcggc cgcgaatt 



ggctgcagga attcgcggcc gctaaatgaa 60 
cataccagga atcacaaagt ctctaaattt 120 
agaataattc caggccaaaa tatgttaaat 180 
ggacatgtgg aaatgacaca ttatcttcag 240 
ttggaaaggc tacactacta ttactggttt 300 
tgcttttctt acaggccttt cattatgatc 360 
ctacttcatt aggtgtggtt cattcctatt 420 
ttttgaaatt taatattgta tgaattagca 480 
aaaagtcatg cctattgtgt tcagtgtgcc 540 
aggattgctg ctctaccagt tgaccaggag 600 
ttttgtttcg tttttttttt tttttttgag 660 
tacagtggtg cgatctcggc tcactgcaaa 72 0 
gcctcggcct cccgagtagc tgggactaca 780 
gtatttttag tagagacagg gtttcaccat 840 
tcatgatccg cccgcctccg ccttccaaag 900 
ccggccgaca ctgggctttt tatgagagtg 960 
aagtaatgta ggggaaatgg cgattatctt 1020 
tgataaatga aaaattagtt attcttgtaa 1080 
agttcaagta aaagaaccag ggctgggtgt 114 0 
tgggaggcca aggcagatgg atctcttgag 1200 
tggcaaaacc gtgtctctac aaaaaataca 1260 
gtagtctcag atattcagga gaccgagggg 1320 
gcagtgagct gagatcgcac cactgctcgc 1380 
caaaaaaaaa ccaaaccaaa ccaaagaacc 144 0 
ttcacaactg gcattatgtt ttgctactgt 1500 
tttaataaaa tttaatcata acttcaaaaa 1560 

1588 



<210> 18 
<211> 
<212> DNA 

<213> Homo sapiens 
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<400> 18 



1 gcggccgctc ttttacccag cggtcaaaac tggcacaaca caggatttat 
51 actggagagc aacctcacaa atgtaacgaa agtggtaagg ctttttatca 
aatgtcctcg tctttggggt catgagaaaa ttcatacaga atctattaaa 
aatgcactaa acatggaaag acctttaagc aacagtgatg tgatgaaggt 
ggtagttttt taaatgaact cccataagag tctacacacc atagaactca 
taccaggaat cacaaagtct ctaaatttcc aaagttaact ggaaatatta 
caaaccgcag aataattcca ggccaaaata tgctaaattc ataacatgat 
gtatatcaaa ggaaaaaagg acatgtggaa atgacacatt atcttcagtg 
cataaaatat tcatttatgt gaagtttctt ggaaaggcta cactactatt 
actggtttcc gtctgatgtt tgagatctgt tgattttatg cttttcttac 
aggcctttca ttatgatctt tgggaaggaa ccaataaaat gatagggcct 
acttcattag gtgtggttca ttcctattca tgctcgcggc cgctctagaa 
ctagtggacc ccccgggctg caggaattcg cggccgctaa atgaactccc 
ataagagtct acacaccata gaactcatac caggaatcac aaagtctcta 
aatttccaaa gttaactgga aatattacaa actgcagaat aattccaggc 
caaaatatgt taaattcata acatgatgta tatcaaagga aaaaaggaca 
tgtggaaatg acacattatc ttcagtgtat aaaatattca tttatgtgaa 
gtttcttgga aaggctacac tactattact ggtttccgtc tgatgtttga 
gatctgttga ttttatgctt ttcttacagg cctttcatta tgatctttgg 
gaaggaatca ataaaatgat agggcctact tcattaggtg tggttcattc 
ctattcatgc tccctggaag aacaagaatg ctgaattttg aaatttaata 
ttgtatgaat tagcatcagg gagaggtgga gaaaaataca aaactaaaag 
tcatgcttat tgtgttcagt gtgcccttct ccagagggcc actggcttat 
aggaaaggat tgctgctcta ccagttgacc aggagatggc acgccaggac 
attaagacac tggagttttg tttcgttttt tttttttttt ttgagatgga 
gtctcgctct cttgacaggc aggagtacag tggtgcgatc tcggctcact 
gcaaactccg cctcccgggt tcaagtgatt ctcctgcctc ggcctcccga 
gtagctggga ctacaggcgt gtgccaccac ccccagctaa cttttgtatt 
tttagtagag acagggtttc accatgttgg ccaggatggt ctcaatctct 
tgacctcatg atccgcccgc ctccgccttc caaagtgctg ggattacagg 
cgtgagccag tgtgcccggc cgacactggg ctttttatga gagtgacaga 
ttactaggac ctcattatgt ggtagaagta atgtagggga aatggcgatt 
atcttttttt aaaagcaata gctgttgtat atcaatgata aatgaaaaat 
tagttattct tgtaaattga agaaagaatg gttatcatag agggtagttc 
aagtaaaaga accagggctg ggtgtggtgg ctcacgttct gtaatccctg 
tactttggga ggccaaggca gatggatctc ttgaggccag gagttcgaga 
ccagcctgac caacatggca aaaccgtgtc tctacaaaaa atacaaaaat 
tagccggaca tcgtggtaga tgcctgtagt ctcagatatt caggagaccg 
aggggaaaat cacttgaacc cgggggacgg aggttgcagt gagctgagat 
cgcaccactg ctcgccagcc tgggcaacag agtgagactc tgcctcaaaa 
aaaaaccaaa ccaaaccaaa gaaccagaat agcatgtgca catatacaca 
gacgtttcac aactggcatt atgttttgct actgttttat ttacaatgta 
2101 tcacaagttt tatgctttaa taaaatttaa tcataacttc aaaaaaaaaa 
aaaaaaaaaa gcggccgcga att 



101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 



210> 19 
211> 

212> DNA 

213> Homo sapiens IMX 32 
400> 19 

210> 20 
211> 2410 
212> DNA 

213> Homo sapiens 
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gaattcggca cgaggaaaac attcgcccct tgcagaagat cacccctagt tctccctcgg 60 
aagagcacca gaaggcctgg aacctcttta accgcacgct gcctttctac ttcgttcaga 120 
agantgagcg agcacagaac ccggccctcc gggaagtcta ccagtggtgc gttggggctc 180 
gctcttggcg ggctggtgac tctgtccctt cacaccactg gctggttgcc acacgtggcc 240 
cgggtttcca ggaaaagcag agcggcagtt agggccgcca tgtgctggga gctgtgtgtc 300 
tgctctcctt cgtccgcccc cccagggcag tgtggtagca catcccattg tagagatgag 360 
ggcaccgagg cttcctggag cataccacct ggtcccgttc atgagtggtg gcaaagctag 420 
cactctcact tgtccattct gccttcctgg agaccagtgg gatgggtcag tacagcccac 480 
cacaccatta gccccaggaa cataaggctg tggctagaca gcaggggtct caggttcata 540 
catgaggact ggcttgtcct tgagcaccca ctcacctgtc tatgtgggga ggaaccctac 600 
aataggtcac catggcaggc tgggtcttgc tgacctgtcc ccagatgggg ttggggtagt 660 
gtaatgtgta ctctgtgcac agtgatgaag tctgggaatg ggagagggga gaaggatggg 720 
cacccactga ccagcagcct gaaaattcct acagcatccc agggctcagc tccatgcagg 780 
agcaaggtgg gggtggggtt gggggaaatg ctacccattt tccaagggct gctctgcttt 840 
tggagtccag ggaaccgctg ctgtctggag ctgtggaggg agggttttca cccagctccc 900 
acgatccccc ttcttttcca caccctggct tgtggctgga gccttacagg cctagtcagg 960 
gtagcctgtg acctgcgtct cttggtccca ggacactttt ggaattttgg aaaaatgtgt 1020 
tgttttgcat caggccggct gtatttggtg gccggcacac tctgccccca gcacacattc 1080 
ttctgtgatt ctaggcaaaa aggacagatg cagaagcaga acggagggaa ggccgtggac 114 0 
gagcggcagc tgttccacgg caccagcgcc atttttgtgg acgccatctg ccaacagaac 1200 
tttgactggc gggtctgtgg tgttcatggc acttcctacg gcaaggggag ctactttgcc 1260 
cgagatgctg catattccca ccactacagc aaatccgaca cgcagaccca cacgatgttc 1320 
ctggcccggg tgctggtggg cgagttcgtc aggggcaatg cctcctctgt ccgcccgccg 13 80 
gccaaggagg gctggagcaa cgccttctat gatagctgcg tgaacagtgt gtccgacccc 1440 
tccatctttg tgatctttga gaaacaccag gtctacccag agtatgtcat ccagtacacc 1500 
acctcctcca agccctcggt cacaccctcc atcctgctgg ccttgggctc cctgctcagc 1560 
agccgacagt gagcgcacag gagtgttcca ggcctttcac ctgctctgcc ttgaaatggc 1620 
tatttgggcc tttccttttc tttttaaaca gaaactttta atgaactgtt ctcttaacat 1680 
tgacctctca atgaagttat gttcttaatc tcttgctaat aatgattttt acttttaagt 1740 
cacttttggg ttcactagtg gattaaccag aagtgattgt agttgagtcc agttttgctt 1800 
tttaataatg tgttgaagtt ttagttttta ctctttgttg actttgctgc ttattggcac 1860 
cagggacaga gtttctagat acaattttat ggattggttt taatttttat gagtttgtct 1920 
ctgcagtgat tcggtttctc agagtctcat ggcatcatag tttttccaga atgacacagt 1980 
agccaccggt ggatgacagc ccacgggcgg cacagtcact tctgcctgtt gctctgacac 2 040 
caacccaggc agctctgctg tggcttctcc tgggctctgg cattagttgg tctgtgtcac 2100 
attgtcagaa caggtggctg ctgtgtggtg ccatcgagtc cctgctggtt ccccttgtcc 2160 
tgggagggtc acccattgcc caaggaagtg catccacctg gcaggtgacc tggaggagta 2220 
gcttccccga ggacccccag gcttggcctg tgattgcgca aacccacatt tcctaagcac 2280 
actggacacc cttcgagtgt gggttttaac atccctgtga gattgaatac ttgtgccaca 2340 
catgtcacaa aagagtatgg aaataaaaga aaatttatcc gaaaaaaaaa aaaaaaaaat 2400 
gagcggccgc 241Q 



<210> 21 
<211> 
<212> DNA 

<213> Homo sapiens IMX 40 



<400> 21 






1 


ttgcagagct 


cgtttagtga 


51 


tagggcggcc 


gcgaattcgg 


101 


acctgggttt 


cgacttgtga 


151 


gagtcatagg 


cgatgagagg 


201 


caactcctga 


gaaggacaca 


251 


gcctacctca 


ttgggctcgt 


301 


cgcggcacga 


gccagaggct 


351 


ttggagagac 


ccagaaaaga 


401 


ctcaggcaat 


gggggtggct 


451 


ccacttcttc 


cacctgccct 


501 


tcagaacaga 


gctgagtgca 


551 


aacgatgaag 


aatttcaaga 



8 
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601 


aaaatctarr 

w» a \^ v-- i^- y w 


u uuyycccaa 


651 


aacttocaot 




701 




*~ u. LyyydSa t. 


751 


taacacaaat 

%A ■ ^ * V» LA W 


t~ ^ ^ -j i-t f- r-ro f r~r 


801 


aoaaaacaac 

^* ™^ ^* * ■* 


k-^eiaaLcidLct 


851 


ccaaaatcaa 


y ty LLctLLaa 


901 


ay^- waen, y v_ ci 


LLLdCyCtyC 


951 


caattttotr 


y^.aLyyday d 


1001 


atcccf r cjc3 r* 

^ » ^ L \m*l V— 


f t~ f f~ a ^ ^ 


1051 


t QQacar3ar 

^- w c* y 


ay day i_ d i_yct 


1101 

-A- W d^r 


iu> i. w y y y y o. 


gaagacccgg 


1151 


aaoaaaaafr 


aayava^dLL 


1201 


»■* ^3 ^3 c» i_ Lp y u V— ■ 




1251 


tQQacacato 

™^ V* u U t^* 


aayaua^aay 


1301 


cct tacatoa 


o y *— o. L. y i_ a. a. 


1351 


atggtggtaa 


aggtattttt 


1401 


caggttaccc 


accatactac 


1451 


t taagaacaa 


agattgaggt 


1501 


ctacaacata 


gaaaccctat 


1551 


tcaaactcag 


gactccggtc 


1601 


cctgccagcc 


tgccctatgg 


1651 


aaaataaatc 


tctccgtct 


<210> 22 







<211> 
<212> DNA 

<213> Homo sapiens, IMX42 



tgtgtaaaca ctngctgagg aacttggaac 
atcgcgaaac cagcaacctg ttcacaattc 
tcgaagtgga gtacccgcac caacacgaaa 
ctcaacaacg aaaagcaaaa agaaccaaag 
tctggatgta tctaaaaata tacaatgcct 
tgaaaattct gaagaaatgc caccggacat 
tgttactcat ttcctgcttt tcacagaaaa 
ccgtgaactt caaagggcag aggggaaact 
ccaaggcccc actttttcat caacactccc 
acataatatg gtcctgaatg aggctgagtc 
gttaataaaa ataggaaggt aagaaaagaa 
ataggactaa attcctatta tttatccact 
ggccagatag ttgtaccccc atgtaccagg 
aagtgctgtg atggttcatt ttgcacgtca 
ctcattgttt ggtcaaacac tagtctggac 
tagatgagat taacgcttaa atcagtaaag 
gggtgggccc tgtccaatca gttgaaggca 
ttcctaaaga agatggaatt ctccttgaga 
ctgagtttcc agcctgttgc cctgtggaat 
tatggcatta accctcactt aacttttcag 
atttcggact tgccagccac acaattcctt 



<400> 22 



<210> 23 
<211> 1025 
<212> DNA 

<213> Homo sapiens 
<400> 23 

atggatagtc gccacacctt tgcccctgct gcgatgaccc tgtcgccact tctgctgttc 60 
ctgccaccgc tgctgctgct gctggacgtc cccacggcgg cggtgcaggc gtcccctctg 120 
caagcgttag acttctttgg gaatgggcca ccagttaact acaagacagg caatctatac 180 
ctgcgggggc ccctgaagaa gtccaatgca ccgcttgtca atgtgaccct ctactatgaa 240 
gcactgtgcg gtggctgccg agccttcctg atccgggagc tcttcccaac atggctgttg 300 
gtcatggaga tcctcaatgt cacgctggtg ccctacggaa acgcacagga acaaaatgtc 360 
agtggcaggt gggagttcaa gtgccagcat ggagaagagg agtgcaaatt caacaaggtg 420 
gaggcctgcg tgttggatga acttgacatg gagctagcct tcctgaccat tgtctgcatg 480 
gaagagtttg aggacatgga gagaagtctg ccactatgcc tgcagctcta cgccccaggg 540 
ctgtcgccag acactatcat ggagtgtgca atgggggacc ccggcatgca gctcatgcac 600 
gccaacgccc agcggacaga tgctctccag ccaccacacg agtatgtgcc ctgggtcacc 660 
gtcaatggga aacccttgga agatcagacc cagctcctta cccttgtctg ccagttgtac 720 
cagggcaaga agccggatgt ctgcccttcc tcaaccagct ccctcaggag tgtttgcttc 780 
aagtgatggc cggtgagctg cggagagctc atggaaggcg agtgggaacc cggctgcctg 840 
cctttttttt ctgatccaga ccctcggcac ctgctactta ccaactggaa aattttatgc 900 
atcccatgaa gcccagatac acaaaattcc accccatgat caagaatcct gctccactaa 960 
gaacggtgct aaagtaaaac tagtttaata agcaaaaaaa aaaaaaaaaa aattcctgcg 1020 
gCC 9 c 1025 



<210> 
<211> 
<212> 



24 

1039 
DNA 



0 



0 
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<213> Homo sapiens 
<400> 24 

gaattcggca cgagggctgc agtcgccaca cctttgcccc tgctgcgatg accctgtcgc 60 
cactcctgct gtccctgcca ccgctgccgc tgctgccgga cgtccccacg gcggcggtqc 120 
aggcgtcccc cctgcaagcg ttagacttct ttgggaatgg gccaccagct aactacaaga 180 
caggcaacct acacctgcgg gggcccctga agaagcccaa Cgcaccgcct gtcaatqtqa 240 
ccctctacta tgaagcactg tgcggtggct gccgagcctt cccgacccgg gagctcttcc 300 
caacatggct gccggtcatg gagatcctca atgtcacgcc ggtgccctac ggaaacgcac 360 
aggaacaaaa tgtcagtggc aggtgggagt tcaagtgcca gcatggagaa gaggagtqca 420 
aattcaacaa ggtggaggcc cgcgtgttgg atgaacttga catggagcta gccttcctga 480 
ccattgcctg catggaagag tttgaggaca tggagagaag cctgccacta tgcctgcaqc 540 
tctacgcccc agggctgtcg ccagacacta tcatggagtg tgcaacgggg gaccccggca 600 
tgcagctcat gcacgccaac gcccagcgga cagatgctct ccagccacca cacgagtatq 660 
tgccctgggt caccgtcaat gggaaaccct tggaagatca gacccagctc cttacccttq 720 
cctgccagtt gcaccagggc aagaagccgg atgtctgccc ttcctcaacc agctccctca 780 
ggagtgtttg ccccaagtga tggccggtga gctgcggaga gctcatggaa ggcgagtgqq 840 
Soaaa^rr? C " gCCtCtt "ttctgatc cagaccctcg gcacctgcta cttlccaac? 900 
ggaaaatttt atgcatccca tgaagcccag atacacaaaa Ctccacccca tgatcaagaa 960 

«»C»rr " aagaat " «=9««..g t . aaactagttt aataagcaaa aaaaaaaLa 1020 

aaaaaattcc tgcggccgc 



1039 



<210> 25 
<211> 466 
<212> DNA 

<213> Homo sapiens 
<400> 25 



atggatagtc gccacacctt tgcccctgct gcgatgaccc tgtcgccact tctgctgttc 60 
ctgccaccgc tgctgctgct gctggacgtc cccacggcgg cggtgcaggc gtcccctctg 120 
n^ 9 " a9 aCttcttt 99 gaatgggcca ccagttaact acaagacagg caatctatac 180 
ctgcgggggc ccctgaagaa gtccaatgca ccgcttgtca atgtgaccct ctactatgaa 240 
gcactgtgcg gtggctgccg agccttcctg atccgggagc tcttcccaac atggctgttg 300 
gtcatggaga tcctcaatgt cacgctggtg ccctacggaa acgcacagga acaaaatgtc 360 
agtggcaggt gggagttcaa gtgccagcat ggagaagagg agtgcaaatt caacaagqtq 420 
gaggcctgcg tgttggatga acttgacatg gagctagcct tcctga 466 



<210> 26 
<211> 
<212> DNA 

<213> Homo sapiens 
<400> 26 



<210> 27 
<211> 32 
<212> PRT 

<213> Homo sapiens 
<400> 27 

Met His Trp Glu Glu Ala Gin He Ser Arg Ala Val Leu Ser Leu Pro 

5 10 15 

Arg He Asp Leu Cys Val Ser Pro Asn Lys Leu Thr Tyr Ser Pro Lvs 

20 25 30 

<210> 28 
<211> 98 



10 
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<212> PRT 

<213> Homo sapiens 
<400> 28 

Met Glu Phe Asn Thr Thr His Tyr Arg Glu Phe Gly Pro Arg Gly Gin 
15 io is 

Glu Phe Gly Thr Arg Gin Gin Gin Gin Gin Lys Lys Thr Glu His Leu 

20 25 30 

His lie Thr Asp Thr Gin Phe Lys Lys Gin Asn He Thr Ala Pro Ser 

35 40 45 

Arg lie Phe Leu Gly Ser Leu Pro Ser Leu Leu Thr Pro Asp Tyr Lys 
50 55 60 

Gin Pro Pro Pro He Ser Pro Asp He Val Leu Tyr Glu Ser Ser Ser 
65 70 75 80 

Ser Gin Met Gly Leu Phe Cys Pro Leu Gly Thr Leu Gly Ser He Trp 

85 90 95 

Arg His 



<210> 29 
<211> 663 
<212> PRT 

<213> Homo sapiens 
<400> 29 

Met He Val Gin Met Thr Val He Leu Lys Leu Glu Met Pro Gin Asp 
1 5 10 15 

Ser Leu He Leu Glu Lys Ser Gin Asn Trp Ser Ser Gin Lys Met Asp 

20 25 30 

His He Leu He Cys Cys Val Cys Leu Gly Asp Asn Ser Glu Asp Ala 

35 40 45 

Asp Glu He He Gin Cys Asp Asn Cys Gly lie Thr Val His Glu Gly 
50 55 60 

Cys Tyr Gly Val Asp Gly Glu Ser Asp Ser lie Met Ser Ser Ala Ser 
65 7 0 75 80 

Glu Asn Ser Thr Glu Pro Trp Phe Cys Asp Ala Cys Lys Cys Gly Val 

85 90 95 

Ser Pro Ser Cys Glu Leu Cys Pro Asn Gin Asp Gly He Phe Lys Glu 

100 105 no 

Thr Asp Ala Gly Arg Trp Val His lie Val Cys Ala Leu Tyr Val Pro 
115 120 125 

Gly Val Ala Phe Gly Asp He Asp Lys Leu Arg Pro Val Thr Leu Thr 
130 135 140 

Glu Met Asn Tyr Ser Lys Tyr Gly Ala Lys Glu Cys Ser Phe Cys Glu 
145 150 155 160 



11 
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Asp Pro Arg Phe Ala Arg Thr Gly Val Cys lie Ser Cys Asp Ala Gly 

165 170 175 

5 Met Cys Arg Ala Tyr Phe His Val Thr Cys Ala Gin Lys Glu Gly Leu 

180 185 190 



10 



Leu Ser Glu Ala Ala Ala Glu Glu Asp lie Ala Asp Pro Phe Phe Ala 
195 200 205 

Tyr Cys Lys Gin His Ala Asp Arg Leu Asp Arg Lys Trp Lys Arg Lys 
210 215 220 



15 225 



Asn Tyr Leu Ala Leu Gin Ser Tyr Cys Lys Met Ser Leu Gin Glu Arg 

230 235 240 



20 



25 



Glu Lys Gin Leu Ser Pro Glu Ala Gin Ala Arg He Asn Ala Arg Leu 

245 250 255 

Gin Gin Tyr Arg Ala Lys Ala Glu Leu Ala Arg Ser Thr Arg Pro Gin 

260 265 270 

Ala Trp Val Pro Arg Glu Lys Leu Pro Arg Pro Leu Thr Ser Ser Ala 
275 280 285 

Ser Ala He Arg Lys Leu Met Arg Lys Ala Glu Leu Met Gly He Ser 
290 295 300 

™ T«5 ASP 116 Phe Pro Val As P Asn Ser As P Thr ser Ser Ser Val Asp 
30 305 310 315 32 £ 

Gly Arg Arg Lys His Lys Gin Pro Ala Leu Thr Ala Asp Phe Val Asn 

325 330 335 

35 Tyr Tyr Phe Glu Arg Asn Met Arg Met He Gin He Gin Glu Asn Met 

340 345 350 

Ala Glu Gin Lys Asn He Lys Asp Lys Leu Glu Asn Glu Gin Glu Lys 
355 360 , 6S 

40 Je>i> 

Leu His Val Glu Tyr Asn Lys Leu Cys Glu Ser Leu Glu Glu Leu Gin 
370 375 380 

Asn Leu Asn Gly Lys Leu Arg Ser Glu Gly Gin Gly He Trp Ala Leu 

41 385 390 395 400 

Leu Gly Arg He Thr Gly Gin Lys Leu Asn He Pro Ala He Leu Arg 

4 °5 410 415 



50 



Ala Pro Lys Glu Arg Lys Pro Ser Lys Lys Glu Gly Gly Thr Gin Lv 

420 425 43o 



Thr Ser Thr Leu Pro Ala Val Leu Tyr Ser Cys Gly He Cys Lys Lys 

55 435 44 0 445 

Asn His Asp Gin His Leu Leu Leu Leu Cys Asp Thr Cys Lys Leu His 
450 455 460 

60 111 ASP Pr ° Pr ° LeU THr Arg Met Pro Ar 9 L Y S 

470 475 480 
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Thr Lys Asn Ser Tyr Trp Gin Cys Ser Glu Cys Asp Gin Ala Gly Ser 

485 490 495 

Ser Asp Met Glu Ala Asp Met Ala Met Glu Thr Leu Pro Asp Gly Thr 

500 505 sio 

Lys Arg Ser Arg Arg Gin lie Lys Glu Pro Val Lys Phe Val Pro Gin 
515 520 525 

Asp Val Pro Pro Glu Pro Lys Lys He Pro He Arg Asn Thr Arg Thr 
530 535 540 



Arg Gly Arg Lys Arg Ser Phe Val Pro Glu Glu Glu Lys His Glu Glu 
545 550 555 560 

Arg Val Pro Arg Glu Arg Arg Gin Arg Gin Ser Val Leu Gin Lys Lys 

565 570 575 

Pro Lys Ala Glu Asp Leu Arg Thr Glu Cys Ala Thr Cys Lys Gly Thr 

580 585 590 

Gly Asp Asn Glu Asn Leu Val Arg Cys Asp Glu Cys Arg Leu Cys Tyr 
595 600 605 

His Phe Gly Cys Leu Asp Pro Pro Leu Lys Lys Ser Pro Lys Gin Thr 
610 615 620 

Gly Tyr Gly Trp He Cys Gin Glu Cys Asp Ser Ser Ser Ser Lys Glu 
625 630 635 640 

Asp Glu Asn Glu Ala Glu Arg Lys Asn He Ser Gin Glu Leu Asn Met 

645 650 655 

Glu Gin Lys Asn Pro Lys Lys 

660 



<210> 30 

<211> 372 

<212> PRT 

<213> Homo sapiens 

<400> 30 

Met Ser Lys Ala Phe 
1 5 

Ala Glu Ser Ser Gin 

20 

Asp Ser Asn Met Lys 

35 

Asp Tyr Pro His Gly 
50 

Cys Leu Asn Ser Leu 
65 

Arg He Leu Lys Arg 

85 



Gly Leu Leu Arg Gin He 

10 

Ser Pro Ala Asp Leu Glu 

25 

Arg Glu Gin Pro Arg Glu 

40 

Leu Val Gly Leu His Asn 
55 

He Gin Val Phe Val Met 
70 - 75 

He Thr Val Pro Arg Gly 

90 



Cys Gin Ser lie Leu 

15 

Glu Lys Lys Glu Glu 

30 

Arg Pro Arg Ala Trp 
45 

lie Gly Gin Thr Cys 
60 

Asn Val Asp Phe Thr 

80 

Ala Asp Glu Gin Arg 

95 
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Arg Ser Val Pro Phe Gin Met Leu Leu Leu Leu Glu Ly 



100 



105 



s Met Gin Asp 
110 



Ser Arg. Gin Lys Ala Val Arg Pro Leu Glu Leu Ala Tyr Cys Leu Gin 
115 120 125 

Lys Cys Asn Val Pro Leu Phe Val Gin His Asp Ala Ala Gin Leu Tyr 



130 



135 



140 



Leu Lys Leu Trp Asn Leu lie Lys Asp Gin He Thr Asp Val His Leu 
145 15 <> 155 160 

Val Glu Arg Leu Gin Ala Leu Tyr Met He Arg Val Lys Asp Ser Leu 

l fi 5 170 175 

He Cys Val Asp Cys Ala Met Glu Ser Ser Arg Asn Ser Ser Met Leu 

180 185 190 

Thr Leu Pro Leu Ser Leu Phe Asp Val Asp Ser Lys Pro Leu Lys Thr 
195 200 205 

Leu Glu Asp Ala Leu His Cys Phe Phe Gin Pro Arg Glu Leu Ser Ser 
210 215 220 

Lys Ser Lys Cys Phe Cys Glu Asn Cys Gly Lys Lys Thr Arg Gly Lys 
225 230 235 240 

Gin Val Leu Lys Leu Thr His Leu Pro Gin Thr Leu Thr He His Leu 

245 250 255 

Met Arg Phe Ser He Arg Asn Ser Gin Thr Arg Lys He Cys His Ser 

260 265 270 

Leu Tyr Phe Pro Gin Ser Leu Asp Phe Ser Gin He Leu Pro Met Lys 
275 280 285 

Arg Glu Ser Cys Asp Ala Glu Glu Gin Ser Gly Gly Gin Tyr Glu Leu 
290 295 300 

Phe Ala Val He Ala His Val Gly Met Ala Asp Ser Gly His Tyr Cys 
305 310 315 320 

Val Tyr He Arg Asn Ala Val Asp Gly Lys Trp Phe Cys Phe Asn Asp 

325 330 335 

Ser Asn He Cys Leu Val Ser Trp Glu Asp He Gin Cys Thr Tyr Gly 

340 345 350 

Asn Pro Asn Tyr His Trp Gin Glu Thr Ala Tyr Leu Leu Val Tyr Met 



355 



360 



365 



Lys Met Glu Cys 
370 



<210> 31 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<400> 31 



14 



Met Ala Ala Ala Leu Leu Pro Ser Gly 
1 5 

Phe lie Leu Glu Ser Asn Leu Thr Asn 

20 25 

Phe lie Lys Cys Pro Cys Leu Trp Gly 

35 40 

Ser lie Lys Asn Val Leu Asn Met Glu 
50 55 

Val Met Lys Val Val Val Phe 
65 70 



PCT/US99/26788 

Gin Asn Trp His Asn Thr Gly 
10 is 

Val Met Lys Val Val Arg Leu 

30 

His Glu Lys lie His Thr Glu 

45 

Arg Pro Leu Ser Asn Ser Asp 

60 



<210> 32 
<211> 
<212> PRT 

< 2 1 3 > Homo s ap i ens 
<400> 32 



<210> 33 
<211> 302 
<212> PRT 

<213> Homo sapiens 



<400> 33 



Met Cys Thr Leu Cys Thr Val Met Lys Ser Gly Asn Gly Arg Gly Glu 
15 10 15 

Lys Asp Gly His Pro Leu Thr Ser Ser Leu Lys lie Pro Thr Ala Ser 



15 
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20 25 30 

Gin Gly Ser Ala Pro Cys Arg Ser Lys Val Gly Val Gly Leu Gly Glu 

35 40 45 

Met Leu Pro He Phe Gin Gly Leu Leu Cys Phe Trp Ser Pro Gly Asn 
50 55 60 

Arg Cys Cys Leu Glu Leu Trp Arg Glu Gly Phe His Pro Ala Pro Thr 
65 7 <> 75 eo 

He Pro Leu Leu Phe His Thr Leu Ala Cys Gly Trp Ser Leu Thr Gly 

85 90 95 

Leu Val Arg Val Ala Cys Asp Leu Arg Leu Leu Val Pro Gly His Phe 

100 105 no 

Trp Asn Phe Gly Lys Met Cys Cys Phe Ala Ser Gly Arg Leu Tyr Leu 
115 .120 125 

Val Ala Gly Thr Leu Cys Pro Gin His Thr Phe Phe Cys Asp Ser Arg 
130 135 i 4 o 

Gin Lys Gly Gin Met Gin Lys Gin Asn Gly Gly Lys Ala Val Asp Glu 
145 "0 155 160 

Arg Gin Leu Phe His Gly Thr Ser Ala He Phe Val Asp Ala He Cys 

165 170 175 

Gin Gin Asn Phe Asp Trp Arg Val Cys Gly Val His Gly Thr Ser Tyr 

I 80 185 190 

Gly Lys Gly Ser Tyr Phe Ala Arg Asp Ala Ala Tyr Ser His His Tyr 
195 200 205 

Ser Lys Ser Asp Thr Gin Thr His Thr Met Phe Leu Ala Arg Val Leu 
210 215 220 

Val Gly Glu Phe Val Arg Gly Asn Ala Ser Phe Val Arg Pro Pro Ala 
225 2 30 235 240 

Lys Glu Gly Trp Ser Asn Ala Phe Tyr Asp Ser Cys Val Asn Ser Val 

245 250 255 

Ser Asp Pro Ser He Phe Val He Phe Glu Lys His Gin Val Tyr Pro 

260 265 270 

Glu Tyr Val He Gin Tyr Thr Thr Ser Ser Lys Pro Ser Val Thr Pro 
275 280 285 

Ser He Leu Leu Ala Leu Gly Ser Leu Phe Ser Ser Arg Gin 
290 295 300 



<210> 34 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Met Pro Val Tyr Gly He Asn Pro His Leu Thr Phe Gin Pro Ala S 



er 
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10 



15 



Leu Pro Tyr Gly Phe Arg Thr Cys Gin Pro His Asn Ser Leu Lys 

20 25 



30 



<210> 35 
<211> 95 
<212> PRT 

<213> Homo sapiens 
<400> 35 

Met Leu lie Glu Asp 
1 5 

Leu Glu Pro He Cys 

20 

Val Ala Leu Val Lys 

35 

Cys Ala Asp Gin Leu 
50 

Val Asp Lys Leu Phe 
65 

Leu Glu Pro Val Lys 

85 



Val Asp Ala Leu Lys 

10 

Asp Ala Asp Pro Ser 

25 

Lys Asp Lys Pro Glu 

40 

Asp Val Phe Leu Gin 
55 

Glu Ser Leu Tyr Thr 
70 

Pro Glu Pro Lys Pro 

90 



Ser Trp Leu Ala Lys Leu 

15 

Ala Leu Ala Asn Tyr Val 

30 

Lys Glu Leu Lys Ala Phe 

45 

Lys Glu Thr Ser Gly Phe 
60 

Lys Asn Tyr Leu Pro Leu 
75 80 

Leu Ala Gin Glu Lys 

95 



<210> 36 
<211> 
<212> PRT 

<213> Homo sapiens 
<400> 36 
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<210> 37 
<211> 261 
<212> PRT 

<213> Homo sapiens 
<400> 37 

Met Asp Ser Arg His Thr Phe Ala Pro Ala Ala Met Thr Leu Ser Pro 

Leu Leu Leu Phe Leu Pro Pro Leu Leu Leu Leu Leu Asp val Pro Thr 

20 25 30 

Ala Ala Val Gin Ala Ser Pro Leu Gin Ala Leu Asp Phe Phe Gly Asn 

35 40 45 

Gly Pro Pro Val Asn Tyr Lys Thr Gly Asn Leu Tyr Leu Arg Gly Pro 
50 55 60 

Leu Lys Lys Ser Asn Ala Pro Leu Val Asn Val Thr Leu Tyr Tyr Glu 
65 70 75 80 

Ala Leu Cys Gly Gly Cys Arg Ala Phe Leu He Arg Glu Leu Phe Pro 

85 90 95 

Thr Trp Leu Leu Val Met Glu He Leu Asn Val Thr Leu Val Pro Tyr 

100 105 no 

Gly Asn Ala Gin Glu Gin Asn Val Ser Gly Arg Trp Glu Phe Lys Cys 
115 120 125 

Gin His Gly Glu Glu Glu Cys Lys Phe Asn Lys Val Glu Ala Cys Val 
i30 135 140 

Leu Asp Glu Leu Asp Met Glu Leu Ala Phe Leu Thr He Val Cys Met 
145 150 155 160 

Glu Glu Phe Glu Asp Met Glu Arg Ser Leu Pro Leu Cys Leu Gin Leu 

165 170 175 

Tyr Ala Pro Gly Leu Ser Pro Asp Thr He Met Glu Cys Ala Met Gly 

180 185 190 

Asp Pro Gly Met Gin Leu Met His Ala Asn Ala Gin Arg Thr Asp Ala 
195 200 205 

Leu Gin Pro Pro His Glu Tyr Val Pro Trp Val Thr Val Asn Gly Lys 
210 215 220 

Pro Leu Glu Asp Gin Thr Gin Leu Leu Thr Leu Val Cys Gin Leu Tyr 

225 o o-sc 

235 240 

Gin Gly Lys Lys Pro Asp Val Cys Pro Ser Ser Thr Ser Ser Leu Arg 

245 250 255 

Ser Val Cys Phe Lys 

260 



<2X0> 38 
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<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 38 
Met 
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t Pro Gly Tyr Arg His Cys Thr Pro Ala Trp Val Thr Glu Arg Asp 
15 10 15 



Ser Val Ser Glu Lys 
10 20 



19 




531 Rec' 




09/831621 
BT/7T 1 0 MAY 2001 



Attorney Docket: 98,664- A 



NOVEL DNAs AND POLYPEPTIDES 



Reference to Related Application 

This application claims the benefit of U.S. Provisional Application S.N. 60/107,821, 
filed November 10, 1998, which is incorporated herein by reference. 



Field of the Invention 

The invention is directed to purified and isolated novel polypeptide molecules and 
fragments thereof, the nucleic acid molecules encoding such polypeptides, processes for 
production of recombinant forms of such polypeptide molecules, antibodies generated 
against such polypeptide molecules, fragmented peptides derived from these polypeptide 
molecules, and uses thereof. In particular, the invention is directed to the use of the nucleic 
acid and polypeptide molecules in the study of inflammatory bowel diseases (IBD) 

Description of Related Art 

Damage to the intestinal epithelial barrier is a hallmark of inflammatory 
bowel diseases (IBD). Examples of inflammatory bowel diseases include ileitis, Crohn's 
disease (CD), which can affect the whole digestive tract from mouth to anus, and ulcerative 
colitis (UC), which affects only the large intestine. Studying the factors that influence the 
integrity of the epithelial barrier in vivo is a difficult task for a number of reasons that 
include the complexity of the tissue itself (there are numerous cell types in the gut including 
epithelial, stromal, endocrine, neuronal and hematopoietic) and the technical problems 
associated with tissue manipulation in animals or in isolated organs. As a result of these 
issues, a number of in vitro models of epithelial barrier function have been developed over 
the years. The best characterized of these models is the T84 intestinal epithelial barrier 
system (Dharmsathaphorn et al., Am. J. Physiol. , 246:G204-G208, 1984 and Madara et al., 
J. Cell BioL , 101:2124-2133, 1985). T84 cells, while derived from a human colonic 



adenocarcinoma, have retained many of the properties associated with normal colonic crypt 
cells. T84 cells form polarized monolayers that exhibit high electrical resistance and 
vectorial fluid and chloride secretion reminiscent of colonic crypts in vivo. These properties 
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are directly dependent on a complex of proteins referred to as tight junctions and TS4 cells 
express many of the known members of this complex (Youakim et al. t Submitted for 
publication, 1998). These cells also respond to proinflammatory cytokines, such as 
interieron-gamma (INF-gamma), by decreasing barrier function (Youakim et al.. Submitted 
5 for publication , 1998; Madara et ah, J. Clin. Invest., 83 :724-727, 1989; an d Adams et al., J. 
Immunol, 150:2356-2363, 1993) and by up-regulating MHC Class II molecules and antigen 
presenting activity (Hershberg et al., J. Clin. Invest., 102:792-803, 1998). This model 
system is used to examine how the epithelial barrier is regulated by various agents such as 
interferon-gamma and cells of the immune system (e.g., neutrophils). The T84 model 

10 assists in the elucidation of the mechanism of barrier breakdown and recovery in response to 
these agents and in the identification of proteins (and genes) that may prevent barrier 
breakdown or stimulate barrier recovery. 

Given the important function of the epithelial gut barrier and despite the growing 
body of knowledge, there is a need in the art for the discovery, the identification, and the 

15 elucidation of the roles of new proteins involved in gut barrier function and EBD. 

The identification of the primary structure, or sequence, of an unknown protein is the 
culmination of an arduous process of experimentation. In order to identify an unknown 
protein, the investigator can rely upon a comparison of the unknown protein to known 
peptides using a variety of techniques known to those skilled in the art. For instance. 

20 proteins are routinely analyzed using techniques such as electrophoresis, sedimentation, 
chromatography, sequencing and mass spectrometry. 

In particular, comparison of an unknown protein to polypeptides of known molecular 
weight allows a determination of the apparent molecular weight of the unknown protein 
(T.D. Brock and M.T. Madigan, Biology of Microorganisms 76-77 (Prentice Hall, 6d ed. 

25 1991)). Protein molecular weight standards are commercially available to assist in the 
estimation of molecular weights of unknown protein (New England Biolabs Inc. 
Catalog:130-131, 1995; J. L. Hartley, U.S. Patent No. 5,449,758). However, the molecular 
weight standards may not correspond closely enough in size to the unknown protein to 
allow an accurate estimation of apparent molecular weight. The difficulty in estimation of 

30 molecular weight is compounded in the case of proteins that are subjected to fragmentation 
by chemical or enzymatic means, modified by post-translational modification or processing, 
and/or associated with other proteins in non-covalent complexes. 



In addition, the unique nature of the composition of a protein with regard to its 

specific amino acid constituents results in unique positioning of cleavage sites within the 

protein. Specific fragmentation of a protein by chemical or enzymatic cleavage results in a 

unique "peptide fingerprint' 1 (D. W. Cleveland et al.,7. BioL Chem. 252:1 102-1 106, 1977; 

5 M. Brown et al.. J. Gen. Virol. 50:309-316. 19S0). Consequently, cleavage at specific sites 

results in reproducible fragmentation of a given protein into peptides of precise molecular 

weights. Furthermore, these peptides possess unique charge characteristics that determine 

the isoelectric pH of the peptide. These unique characteristics can be exploited using a 

variety of electrophoretic and other techniques (T.D. Brock and M.T. Madigan, Biology of 

10 Microorganisms 76-77 (Prentice Hall, 6d ed. 1991)). 

Fragmentation of proteins is further employed for amino acid composition analysis 

and protein sequencing (P. Matsudiara, J. BioL Chem. 262:10035-10038, 1987; C. 

Eckerskom et al., Electrophoresis 1988, 9:830-838, 1988), particularly the production of 

fragments from proteins with a "blocked" N-terminus. In addition, fragmented proteins can 

15 be used for immunization, for affinity selection (R. A. Brown, U.S. Patent No. 5,15 1,412), 

for determination of modification sites {e.g. phosphorylation), for generation of active 

biological compounds (T.D. Brock and M.T. Madigan, Biology of Microorganisms 300-301 

(Prentice Hall, 6d ed. 1991)), and for differentiation of homologous proteins (M. Brown et 

al.,y. Gen. ViroL 50:309-316, 1980). 

20 In addition, when a peptide fingerprint of an unknown protein is obtained, it can be 

compared to a database of known proteins to assist in the identification of the unknown 

protein using mass spectrometry (W.J. Henzel et aL, Proc. NatL Acad. Scu USA 90:501 1- 

5015, 1993; D. Fenyo et al., Electrophoresis 19:998-1005, 1998). A variety of computer 

software programs to facilitate these comparisons are accessible via the Internet, such as 

25 Protein Prospector (Internet site: prospector.uscf.edu), Multildent (Internet site: 

www.expasy.ch/sprot/multiident.html), PeptideSearch (Internet site: www.mann.embl- 

heiedelberg.de.. .deSearch/FR_PeptideSearch Fonn.html), and ProFound (Internet site: 

www.chait-sgi.rockefeller.edii/cgi-bin/prot-id-frag.htnil). These programs allow the user to 

specify the cleavage agent and the molecular weights of the fragmented peptides within a 

30 designated tolerance. The programs compare these molecular weights to protein molecular 

weight information stored in databases to assist in determining the identity of the unknown 

protein. Accurate information concerning the number of fragmented peptides and the 

precise molecular weight of those peptides is required for accurate identification. Therefore, 

3 



increasing the accuracy in determining the number of fragmented peptides and their 
molecular weight should result in enhanced likelihood of success in the identification of 
unknown proteins. 

In addition, peptide digests of unknown proteins can be sequenced using tandem 
5 mass spectrometry (MS/MS), and the resulting sequence searched against databases (J.K. 
Eng, et aL J. Am. Soc. Mass Spec. 5:976-989 (1994); M. Mann and M. Wilm, Anal. Chem. 
66:4390-4399 ( 1 994); J.A. Taylor and R.S. Johnson, Rapid Comm. Mass Spec. I 1 : 1067- 
1075 (1997)). Searching programs that can be used in this process exist on the Internet, 
such as Lutefisk 97 (Internet site: www. Isbc.com:70/Lutefisk97. html), and the Protein 
10 Prospector, Peptide Search and Pro Found programs described above. Therefore, adding the 
sequence of a gene and its predicted protein sequence and peptide fragments to a sequence 
database can aid in the identification of unknown proteins using tandem mass spectrometry. 

Thus, there also exists a need in the art for polypeptides suitable for use in peptide 
fragmentation studies, preferably, polypeptides that have altered expression in irritable 
1 5 bowel diseases, for use in molecular weight measurements, and for use in protein 
sequencing using tandem mass spectrometry. 

SUMMARY OF THE INVENTION 

Using the T84 model, it was determined that certain polypeptides have altered (up- 

20 regulated or down-regulated) expression patterns in response to ENF-gamma. Such 

molecules may have a role in gut barrier function and EBD and may be useful as potential 

therapeutic agents in the treatment of IBD and other gut pathologies. The invention aids in 

fulfilling these needs in the art by providing isolated nucleic acids and polypeptides encoded 

by these nucleic acids that have altered expression characteristics in the T84 gut barrier 

25 model. Particular embodiments of the invention are directed to isolated nucleic acid 

molecules comprising the DNA sequences of SEQ ED NOs:l-23 and isolated nucleic acid 

molecules encoding the amino acid sequences of SEQ ID NOs:24-33, as well as nucleic acid 

molecules complementary to these sequences. Both single-stranded and double-stranded 

RNA and DNA nucleic acid molecules are encompassed by the invention, as well as nucleic 

30 acid molecules that hybridize to a denatured, double-stranded DNA comprising all or a 

portion of SEQ ID NOs: 1-23 and/or the DNA that encodes the amino acid sequences of 

SEQ ID NOs:24-33. Also encompassed are isolated nucleic acid molecules that are derived 

by in vitro mutagenesis from nucleic acid molecules comprising sequences of SEQ ID 

4 





NOs: 1-23. that are degeWHte from nucleic acid molecules compnWTg sequences of SEQ ED 
NOs:l-23. and that are allelic variants of DNA of the invention. The invention also 
encompasses recombinant vectors that direct the expression of these nucleic acid molecules 
and host cells transformed or transfected with these vectors. 
5 In another embodiment, the invention provides an isolated nucleic acid molecule 

comprising a polynucleotide having a nucleotide sequence at least 80%, preferable 85% ? 
more preferably 90%, optimally 95%, identical to a sequence of a polynucleotide selected 
from the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO: 1-23 or a polynucleotide which is 
10 hybridizable to SEQ ID NO:l-23; 

(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ED 
NO: 1-23 or a polypeptide fragment encoded by the cDNA sequence which is hybridizable 
to SEQ ID NO: 1-23; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ED NO: 
15 1-23 or a polypeptide epitope encoded by a cDNA sequence which is hybridizable to SEQ 

ID NO: 1-23; 

(d) a polynucleotide encoding a polypeptide of a translation of SEQ ID NO: 1-23, 
having biological activity; 

(e) a polynucleotide which is a variant of SEQ ED NO: 1-23; 

20 (0 a polynucleotide which is an allelic variant of SEQ ID NO.l-23; 

(g) a polynucleotide which encodes a species homologue of a translation of SEQ ID 
NO: 1-23; 

(h) a polynucleotide capable of hybridizing under stringent conditions to any 
one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 

25 hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 80%, preferable 85%, 
more preferably 90%, optimally 95%, identical to a sequence of a polynucleotide selected 
30 from the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:l, 7, 8, 10, 14-17, 19, 21 or a 
polynucleotide which is hybridizable to SEQ ID NO: 1, 7, 8, 10, 14-17, 19, 21; 
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(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ED 
NO: I, 7, 8, 10, 14-17, 19, 21 or a polypeptide fragment encoded by the cDNA sequence 
which is hybridizable to SEQ ID NO: I, 7, 8, 10, 14-17, 19, 21; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ID NO: 
5 1, 7, 8, 10, 14-17, 19, 21 or a polypeptide epitope encoded by a cDNA sequence which is 

hybridizable to SEQ ID NO: 1, 7, 8, 10, 14-17, 19, 21; 

(d) a polynucleotide encoding a polypeptide of a translation of SEQ ED NO: 1, 7, 8, 
10, 14-17, 19, 21, having biological activity; 

(e) a polynucleotide which is a variant of SEQ ID NO: 1, 7, 8, 10, 14-17, 19, 21; 

10 (0 a polynucleotide which is an allelic variant of SEQ ID NO: 1, 7, 8, 10, 14-17, 19, 

21; 

(g) a polynucleotide which encodes a species homologue of a translation of SEQ ID 
NO: 1,7,8, 10, 14-17, 19, 21 1, 7, 8, 10, 14-17, 19,21; 

(h) a polynucleotide capable of hybridizing under stringent conditions to any 
15 one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 

hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

In another embodiment, the invention provides an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 80%, preferable 85%, 
20 more preferably 90%, optimally 95%, identical to a sequence of a polynucleotide selected 
from the group consisting of: 

(a) a polynucleotide fragment of SEQ ID NO: 2-6, 9, 1 1-13, 18, 20, 22, 23 or a 
polynucleotide which is hybridizable to SEQ ID NO:2-6, 9, 1 1-13, 18, 20, 22, 23; 

(b) a polynucleotide encoding a polypeptide fragment of a translation of SEQ ID 
25 NO: 2-6, 9, 11-13, 18, 20, 22, 23 or a polypeptide fragment encoded by the cDNA sequence 

which is hybridizable to SEQ ID NO:2-6, 9, 11-13, 18, 20, 22, 23; 

(c) a polynucleotide encoding a polypeptide epitope of a translation of SEQ ID NO: 
2-6, 9, 11-13, 18, 20, 22, 23 or a polypeptide epitope encoded by a cDNA sequence which is 
hybridizable to SEQ ID NO:2-6, 9, 1 1-13, 18, 20, 22, 23; 

30 (d) a polynucleotide encoding a polypeptide of a translation of SEQ ID NO: 2-6, 9, 

11-13, 18, 20, 22, 23, having biological activity; 

(e) a polynucleotide which is a variant of SEQ ID NO:2-6, 9, 1 1-13, 18, 20, 22, 23; 
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(0 a polynucleotide which is an allelic variant of SEQ E3nO:2-6. 9, 11-13. 18,20. 

(g) a polynucleotide which encodes a species homologue of a translation of SEQ ED 
NO: 2-6.9, 1 1-15. 18, 20, 22, 23; 
5 (h) a polynucleotide capable of hybridizing under stringent conditions to any 

one of the polynucleotides specified in (a)-(g), wherein said polynucleotide does not 
hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

In one embodiment, the isolated nucleic acid molecule comprises a nucleotide 
10 sequence encoding a secreted protein. In preferred embodiments, the isolated nucleic acid 
molecule comprises a nucleotide sequence encoding a polypeptide chosen from the group 
consisting of: 

(a) a polypeptide having the polypeptide sequence identified as a translation of 
SEQ ED NO: 1-23; 

15 (b) a polypeptide having the polypeptide sequence of SEQ ED NO: 24-33; and 

(c) a polypeptide encoded by the cDNA which is hybridizable to SEQ ID NO: 1-23. 

Typically, the isolated nucleic acid molecule comprises the entire nucleotide sequence of 

SEQ ED NO: 1-23 or a cDNA sequence which is hybridizable to SEQ ID NO: 1-23. 

Typically the isolated nucleic acid molecule comprises sequential nucleotide deletions from 
20 portions of the nucleotide sequence encoding either the C-terminus or the N-terminus of the 

polypeptide. 

In other aspects, the invention provides a recombinant vector comprising the isolated 
nucleic acid molecule and a method of making a recombinant host cell comprising the 
isolated nucleic acid molecule and the recombinant host cell produced by such a method. 

25 

In other preferred embodiments, the invention provides an isolated polypeptide 
having an amino acid sequence at least 80%, preferably at least 85%, more preferably at 
least 90% identical to the sequence of a polypeptide selected from the group consisting of: 

(a) a polypeptide fragment of a polypeptide encoded by a polynucleotide of SEQ ID 
30 NO: 1-23; 

(b) a polypeptide having the sequence of SEQ ID NO: 24-33; 

(c) a polypeptide domain of a polypeptide encoded by a polynucleotide of SEQ ID 
NO: 1-23; 
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(d) a polypeptide epitope of a polypeptide encoded by a polynucleotide of SEQ [D 
NO: 1-23: 

(e) a secreted form of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1 

23; 

(0 a full length protein of a polypeptide encoded by a polynucleotide of SEQ ID 
NO: 1-23; 

(g) a variant of a polypeptide encoded by a polynucleotide of SEQ ID NO: 1-23; 

(h) an allelic variant of a polypeptide encoded by a polynucleotide of SEQ ED NO: 
1-23; and 

(i) a species homologue of a polypeptide encoded by a polynucleotide of SEQ ID 
NO: 1-23. 



In some embodiments, the full length polypeptide comprises sequential amino acid 
deletions from the C-terminus. In other embodiments, the mature polypeptide comprises 
sequential amino acid deletions from the C-terminus. Alternatively, the full length 
polypeptide can comprise sequential amino acid deletions from the N-terminus or the 
mature polypeptide can comprise sequential amino acid deletions from the N-terminus. 



In other aspects, the invention provides an isolated antibody that binds specifically to 
the isolated polypeptide, a recombinant host cell that expresses the isolated polypeptide, and 
a method of making an isolated polypeptide comprising culturing the recombinant host cell 
under conditions such that said polypeptide is expressed; and recovering said polypeptide. 



The invention in another embodiment is a method for preventing, treating, or 

ameliorating irritable bowel discorders, comprising administering to a mammalian subject a 

therapeutically effective amount of the isolated polypeptide or the isolated nucleic acid 

molecule. In another embodiment, the invention provides a method of diagnosing an 

irritable bowel disease or a susceptibility to irritable bowel disease in a subject comprising: 

determining the presence or absence of a mutation in the isolayed nucleic acid molecule, and 

diagnosing an irritable bowel disease or a susceptibility to irritable bowel disease based on 

the presence or absence of said mutation. Alternatively, irritable bowel disease or a 

susceptibility to irritable bowel disease in a subject can be diagnosed by a method 

comprising: determining the presence or amount of expression of the polypeptide in a 
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biological sample; and diagnosing irritable bowel disease or a susceptibility to irritable 
bowel disease based on the presence or amount of expression of the polypeptide. 

Typically, a binding partner of the polypeptide is identified by contacting the 
polypeptide with a binding partner; and determining whether the binding partner affects a 
5 physical property or an activity of the polypeptide. Typically activity in a biological assay 
of a secreted polypeptide is identified by expressing the polynucleotide of SEQ ID NO: 1-23 
in a cell; isolating the supernatant; detecting an activity in a biological assay; and 
identifying the polypeptide in the supernatant having the activity. 



10 In addition, the invention encompasses methods of using the nucleic acids noted 

above to identify nucleic acids encoding proteins homologous to SEQ ID NOs:24-33; to 
identify human chromosomes that contain the nucleotide sequences of the invention; to map 
genes near the nucleotide sequences of the invention on human chromosomes; and to 
identify genes associated with certain diseases, syndromes, or other human conditions 

15 associated with human chromosomes containing sequences of the invention. 

For example, four of the nucleotide sequences of the invention, IMX4 and IMX56, 
IMX21, and IMX44 are located on chromosomes 22, 22, 7, an 9, respectively (IMX4 and 
IMX56 are both located on chromosome 22). Thus, the above-named nucleotide sequences 
(1MX4, IMX56, EMX21, and IMX44) can be used to identify human chromosome numbers 

20 22, 7, and 19; to map genes on human chromosome numbers 22, 7, and 19; and to identify 
genes associated with certain diseases, syndromes, or other human conditions associated 
with human chromosome numbers 22, 7, and 19. 

The invention also encompasses the use of sense or anti sense oligonucleotides from 
the nucleic acids of SEQ ID NOs:l-23 to inhibit the expression of the polynucleotides 

25 encoded by the nucleotide sequences of the invention. 

The invention also encompasses isolated polypeptides and fragments thereof 
encoded by these nucleic acid molecules including soluble polypeptide portions of SEQ ID 
NOs:24-33. The invention further encompasses methods for the production of these 
polypeptides, including culturing a host cell under conditions promoting expression and 

30 recovering the polypeptide from the culture medium if it is secreted or from cultured cells if 
it is not secreted. Especially, the expression of these polypeptides in bacteria, yeast, plant, 
insect, and animal cells is encompassed by the invention. 
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In addition, the invention includes assays utilizing these polypeptides, to screen for 
potential inhibitors of activity associated with polypeptide counter-structure molecules, and 
methods of using these polypeptides as therapeutic agents for the treatment of diseases 
mediated by polypeptide counter-structure molecules. Further, methods of using these 
5 polypeptides in the design of inhibitors thereof are also an aspect of the invention. 

The invention further includes a method for using these polypeptides as molecular 
weight markers that allow the estimation of the molecular weight of a protein or a 
fragmented protein, as well as a method for the visualization of the molecular weight 
markers of the invention thereof using electrophoresis. The invention further encompasses 
10 methods for using the polypeptides of the invention as markers fcr determining the 

isoelectric point of an unknown protein, as well as controls for establishing the extent of 
fragmentation of a protein. 

Further encompassed by this invention are kits to aid in these determinations. 

Further encompassed by this invention is the use of the EMX nucleic acid sequences, 
1 5 predicted amino acid sequences of the polypeptide or fragments thereof, or a combination of 
the predicted amino acid sequences of the polypeptides and fragments thereof for use in 
searching an electronic database to aid in the identification of sample nucleic acids and/or 
proteins. 

The invention also encompasses IMX polypeptides and the use of these polypeptides 
20 as research reagents to further study gut epithelial barrier function and regulation and 
therapeutic reagents to treat inflammatory bowel disease and other gut pathologies. 

Isolated polyclonal or monoclonal antibodies that bind to these polypeptides are also 
encompassed by the invention, in addition the use of these antibodies to aid in purifying 
IMX polypeptides. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents the nucleotide sequence of IMX 4 (SEQ ED NO: 1). 

Figure 2 presents the nucleotide sequence of EMX 10 (SEQ ID NO: 14). 

Figure 3 presents the nucleotide sequence of IMX 21 (SEQ ID NO: 15). 

30 Figure 4 presents the nucleotide sequence of IMX 28 (SEQ ID NO: 16). 

Figure 5 presents the nucleotide sequence of EMX 32 (SEQ ID NO: 17). 

Figure 6 presents the nucleotide sequence of IMX 39 (SEQ ID NO: 19). 

Figure 7 presents the nucleotide sequence of IMX 40 (SEQ ID NO: 7). 
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Figure 8 preseru^he nucleotide sequence of IMX 42 (SEQ ID NO: 8). 

Figure 9 presents the nucleotide sequence of IMX 44 (SEQ ID NO: 21). 

Figure 10 presents the nucleotide sequence of IMX 56 (SEQ CD NO: 10). 

Figure 1 1 presents the amino acid sequence of [MX 4 (SEQ ED NO: 24). 
5 Figure 12 presents the amino acid sequence of EMX 10 (SEQ ID NO: 25). 

Figure 13 presents the amino acid sequence of EMX 21 (SEQ ID NO: 26). 

Figure 14 presents the amino acid sequence of IMX 28 (SEQ ED NO: 27). 

Figure 15 presents the amino acid sequence of IMX 32 (SEQ ED NO: 28). 

Figure 16 presents the amino acid sequence of IMX 39 (SEQ ED NO: 29). 
10 Figure 17 presents the amino acid sequence of IMX 40 (SEQ ED NO: 30). 

Figure 18 presents the amino acid sequence of IMX 42 (SEQ ID NO: 31). 

Figure 19 presents the amino acid sequence of IMX 44 (SEQ ID NO: 32). 

Figure 20 presents the amino acid sequence of IMX 56 (SEQ ID NO: 33). 

Figure 21 presents the nucleotide sequence of the 5' end of the clone (SEQ ID 
15 NO: 13) matched to part of the human ApoL gene (AF0 19225). 

Figure 22 presents comparison of the 5* end of the clone suggests it represents an 
alternative splice product to reported ApoL, i.., bases 1-168 match to 2 exons on PAC 
carrying the ApoL gene but are not included in the reported complete cDNA. 
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«S31RAPCT - 10 MAY 2001 
ED DESCRIPTION OF THE INVENTION 

Definitions 

The following definitions are provided to facilitate understanding of certain terms 
used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered 
"by the hand of man" from its natural state. For example, an isolated polynucleotide could 
be part of a vector or a composition of matter, or could be contained within a cell, and still 
be "isolated" because that vector, composition of matter, or particular cell is not the original 
environment of the polynucleotide. 

In the present invention, a "secreted" protein refers to those proteins capable of being 
directed to the ER, secretory vesicles, or the extracellular space as a result of a signal 
sequence, as well as those proteins released into the extracellular space without necessarily r ~ 
containing a signal sequence. If the secreted protein is released into the extracellular space, 
the secreted protein can undergo extracellular processing to produce a "mature" protein. 
Release into the extracellular space can occur by many mechanisms, including exocytosis 
and proteolytic cleavage. 

As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ED NO: 1-23. For example, the polynucleotide can contain all or 
part of the nucleotide sequence of the full length cDNA sequence, including the 5' and 3' 
untranslated sequences, the coding region, with or without the signal sequence, the secreted 
protein coding region, as well as fragments, epitopes, domains, and variants of the nucleic 
acid sequence. Moreover, as used herein, a "polypeptide" refers to a molecule having the 
translated amino acid sequence generated from the polynucleotide as broadly defined. 

A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained in 
SEQ ED NO: 1-23, or the complement thereof, or the cDNA. "Stringent hybridization 
conditions" refers to an overnight incubation at 42° C in a solution comprising 50% 
formamide, 5x SSC (750 mM NaCl, 75 mM sodium citrate), 50 mM sodium phosphate (pH 
7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 (ig/ml denatured, sheared salmon 
sperm DNA, followed by washing the filters in 0.1 x SSC at about 65°C. 
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Also contemplated are nucleic acid molecules that hybridize to the polynucleotides 
of the present invention at lower stringency hybridization conditions. Changes in the 
stringency of hybridization and signal detection are primarily accomplished through the 
manipulation of formamide concentration (lower percentages of formamide result in 
5 lowered stringency); salt conditions, or temperature. For example, lower stringency 

conditions include an overnight incubation at 37°C in a solution comprising 6X SSPE (20X 
SSPE = 3M NaCl; 0.2M NaH 2 PO.t: 0.02M EDTA, pH 7.4), 0.5% SDS, 30% formamide, 
100 ug/ml salmon sperm blocking DNA; 

followed by washes at 50°C with 1XSSPE, 0.1% SDS. In addition, to achieve even lower 

1 0 stringency, washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. Typical blocking reagents include Denhardt's reagent, 

15 BLOTTO, heparin, denatured salmon sperm DNA, and commercially available proprietary 
formulations. The inclusion of specific blocking reagents may require modification of the 
hybridization conditions described above, due to problems with compatibility. 

Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as any 
3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a complementary 

20 stretch of T (or U) residues, would not be included in the definition of "polynucleotide," 

since such a polynucleotide would hybridize to any nucleic acid molecule containing a poly 
(A) stretch or the complement thereof (e.g., practically any double-stranded cDNA clone). 

The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 

25 modified RNA or DNA. For example, polynucleotides can be composed of single- and 
double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded 
regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or a mixture of single- and double-stranded regions. In addition, 

30 the polynucleotide can be composed of triple-stranded regions comprising RNA or DNA or 

both RNA and DNA. A polynucleotide may also contain one or more modified bases or 

DNA or RNA backbones modified for stability or for other reasons. "Modified" bases 

include, for example, tritylated bases and unusual bases such as inosine. A variety of 
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naffe to DNA and RNA: thus. "DoKnucIeouae" 



modifications can be mlffe to DNA and RNA; thus. "poKnucIeoune" embraces chemically, 
enzvmaticallv, or metabolicallv modified forms. 

The polypeptide of the present invention can be composed of amino acids joined to 
each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
5 contain amino acids other than the 20 gene-encoded amino acids. The polypeptides may be 
modified by either natural processes, such as posttranslational processing, or by chemical 
modification techniques which are well known in the art. Such modifications are well 
described in basic texts and in more detailed monographs, as well as in a voluminous 
research literature. Modifications can occur anywhere in a polypeptide, including the 

10 peptide backbone, the amino acid side-chains and the amino or carboxyl termini. It will be 
appreciated that the same type of modification may be present in the same or varying 
degrees at several sites in a given polypeptide. Also, a given polypeptide may contain many 
types of modifications. Polypeptides may be branched, for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, and 

1 5 branched cyclic polypeptides may result from posttranslation natural processes or may be 

made by synthetic methods. Modifications include acetylation, acylation, ADP-ribosylation, 
amidation, covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide 

20 bond formation, demethylation, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formulation, gamma-carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, pegylation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, 
transfer- RNA mediated addition of amino acids to proteins such as arginylation, and 

25 ubiquitination. (See, for instance, PROTEINS - STRUCTURE AND MOLECULAR 

PROPERTIES, 2nd Ed., T. E. Creighton, W. H. Freeman and Company, New York (1993); 
POSTTRANSLATIONAL COVALENT MODIFICATION OF PROTEINS, B. C. Johnson, 
Ed., Academic Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626- 
646 (1990); Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

30 "A polypeptide having biological activity" refers to polypeptides exhibiting activity 

similar, but not necessarily identical to, an activity of a polypeptide of the present invention, 
including mature forms, as measured in a particular biological assay, with or without dose 
dependency. In the case where dose dependency does exist, it need not be identical to that 

* 
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of the polypeptide, but rather substantially similar to the dose-dependence in a given activity 
as compared to the polypeptide of the present invention (i.e., the candidate polypeptide will 
exhibit greater activity or not more than about 25-fold less and, preferably, not more than 
about tenfold less activity, and most preferably, not more than about three-fold less activity 
5 relative to the polypeptide of the present invention.)- 

The translated amino acid sequence, beginning with the methionine, is identified 
although other reading frames can also be easily translated using known molecular biology 
techniques. The polypeptides produced by the translation of these alternative open reading 
frames are specifically contemplated by the present invention. 

10 SEQ ID NO:l-23 and the translations of SEQ ED NO: 1-23 as well as SEQ ID 

NO:24-33 are sufficiently accurate and otherwise suitable for a variety of uses well known 
in the art and described further below. These probes will also hybridize to nucleic acid 
molecules in biological samples, thereby enabling a variety of forensic and diagnostic 
methods of the invention. Similarly, polypeptides identified from the translations of SEQ 

15 ID NO: 1-23 may be used to generate antibodies which bind specifically to the secreted 
proteins encoded by the cDNA clones identified. 

Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or deletions 
of nucleotides in the generated DNA sequence. The erroneously inserted or deleted 

20 nucleotides cause frame shifts in the reading frames of the predicted amino acid sequence. 
In these cases, the predicted amino acid sequence diverges from the actual amino acid 
sequence, even though the generated DNA sequence may be greater than 99.9% identical to 
the actual DNA sequence (for example, one base insertion or deletion in an open reading 
frame of over 1000 bases). 

25 The present invention also relates to the genes corresponding to SEQ ID NO: 1-23, 

and translations of SEQ ID NO: 1-23. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. Such 
methods include preparing probes or primers from the disclosed sequence and identifying or 
amplifying the corresponding gene from appropriate sources of genomic material. 

30 Also provided in the present invention are species homologies. Species 

homologues may be isolated and identified by making suitable probes or primers from the 

* 

sequences provided herein and screening a suitable nucleic acid source for the 
homologue. 
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The poIypeptideFof the invention can be prepared in any suitable manner. Such 
pol\peptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well understood 
in the an. 

The polypeptides may be in the form of the secreted protein, including the mature 
form, or may be a part of a larger protein, such as a fusion protein (see below). It is often 
advantageous to include an additional amino acid sequence which contains secretory or 
leader sequences, pro-sequences, sequences which aid in purification, such as multiple 
histidine residues, or an additional sequence for stability during recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated 
form, and preferably are substantially purified. A recombinantly produced version of a 
polypeptide, including the secreted polypeptide, can be substantially purified by the one-step 
method described in Smith and Johnson, Gene 67:3 1-40 (1988). Polypeptides of the 
invention also can be purified from natural or recombinant sources using antibodies of the 
invention raised against the secreted protein in methods which are well known in the art. 

Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of McGeoch, Virus 
Res. 3:271-286 (1985), uses the information from a short N-terminal charged region and a 
subsequent uncharged region of the complete (uncleaved) protein. The method of von 
Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information from the residues 
surrounding the cleavage site, typically residues -13 to +2, where +1 indicates the amino 
terminus of the secreted protein. Therefore, from a deduced amino acid sequence, a signal 
sequence and mature sequence can be identified. 

Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the polynucleotide 
or polypeptide of the present invention, but retaining essential properties thereof Generally, 
variants are overall closely similar, and, in many regions, identical to the polynucleotide or 
polypeptide of the present invention. 
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rnWTs of the present invention include doI^tocI 



Further embodim^ffs of the present invention include pol^rocleotides having at 
least 80% identity, more preferably at least 90% identity, and most preferably at least 95%, 
96%, 97%, 98% or 99% identity to a sequence contained in SEQ ED NO: 1-23. Of course, 
due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately 
5 recognize that a large number of the polynucleotides having at least 85%, 90%, 95%, 96° o, 
97%, 98%, or 99% identity will encode a polypeptide identical to an amino acid sequence 
contained in the translations of SEQ ED NO: 1-23. 

Similarly, by a polypeptide having an amino acid sequence having at least, for 
example, 95% "identity" to a reference polypeptide, is intended that the amino acid 

10 sequence of the polypeptide is identical to the reference polypeptide except that the 

polypeptide sequence may include up to five amino acid alterations per each 100 amino 
acids of the total length of the reference polypeptide. In other words, to obtain a polypeptide 
having an amino acid sequence at least 95% identical to a reference amino acid sequence, up 
to 5% of the amino acid residues in the reference sequence may be deleted or substituted 

1 5 with another amino acid, or a number of amino acids up to 5% of the total amino acid 
residues in the reference sequence may be inserted into the reference sequence. These 
alterations of the reference sequence may occur at the amino or carboxy terminal positions 
of the reference amino acid sequence or anywhere between those terminal positions, 
interspersed either individually among residues in the reference sequence or in one or more 

20 contiguous groups within the reference sequence. 

Further embodiments of the present invention include polypeptides having at least 
80% identity, more preferably at least 85% identity, more preferably at least 90% identity, 
and most preferably at least 95%, 96%, 97%, 98% or 99% identity to an amino acid 
sequence contained in translations of SEQ ED NO: 1-23. Preferably, the above polypeptides 

25 should exhibit at least one biological activity of the protein. 

In a preferred embodiment, polypeptides of the present invention include 
polypeptides having at least 90% similarity, more preferably at least 95% similarity, and still 
more preferably at least 96%, 97%, 98%, or 99% similarity to an amino acid sequence 
contained in translations of SEQ ED NO: 1-23 as well as the amino acid sequences of SEQ 

30 ID NO:24-33. 

The variants may contain alterations in the coding regions, non-coding regions, or 

both. Especially preferred are polynucleotide variants containing alterations which produce 

silent substitutions, additions, or deletions, but do not alter the properties or activities of the 

17 



encoded polypeptide. Nucleotide variants produced by silent substitutions due to the 

degeneracy of the genetic code are preferred. Moreover, variants in which 5-10. 1-5, or 1-2 

amino acids are substituted, deleted, or added in any combination are also preferred. 

Polynucleotide variants can be produced for a variety of reasons, e.g., to optimize codon 

5 expression for a particular host (change codons in the human mRNA to those preferred by a 

bacterial host such as £. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of several 

alternate forms of a gene occupying a given locus on a chromosome of an organism. (Genes 

II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These allelic variants can vary at 

10 either the polynucleotide and/or polypeptide level. Alternatively, non-naturally occurring 

variants may be produced by mutagenesis techniques or by direct synthesis. 

Using known methods of protein engineering and recombinant DNA technology, 

variants may be generated to improve or alter the characteristics of the polypeptides of the 

present invention. For instance, one or more amino acids can be deleted from the N- 

15 terminus or C-terminus of the secreted protein without substantial loss of biological 

function. The authors of Ron et al., J. Biol. Chem. 268: 2984-2988 (1993) reported variant 

KGF proteins having heparin binding activity even after deleting 3, 8, or 27 amino-terminal 

amino acid residues. Similarly, Interferon gamma exhibited up to ten times higher activity 

after deleting 8-10 amino acid residues from the carboxy terminus of this protein. (Dobeli et 

20 al., J. Biotechnology 7:199-216 (1988).) 

Moreover, ample evidence demonstrates that variants often retain a biological 

activity similar to that of the naturally occurring protein. For example, Gayle and coworkers 

(J. Biol. Chem 268:22105-221 1 1 (1993)) conducted extensive mutational analysis of 

human cytokine IL-l a. They used random mutagenesis to generate over 3,500 individual 

25 EL- 1 a mutants that averaged 2.5 amino acid changes per variant over the entire length of the 

molecule. Multiple mutations were examined at every possible amino acid position. The 

investigators found that "[m]ost of the molecule could be altered with little effect on either 

[binding or biological activity]." (See Gayle et al., (1993), Abstract.) In fact, only 23 unique 

amino acid sequences, out of more than 3,500 nucleotide sequences examined, produced a 

30 protein that significantly differed in activity from wild-type. 

Furthermore, even if deleting one or more amino acids from the N-terminus or C- 

terminus of a polypeptide results in modification or loss cf one or more biological functions, 

other biological activities may still be retained. For example, the ability of a deletion 
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variant to induce and'or to bind antibodies which recognize the secreted form will likely be 
retained when less than the majority of the residues of the secreted form are removed from 
the N-tenninus or C-tenminus. Whether a particular polypeptide lacking N- or C-terminal 
residues of a protein retains such immunogenic activities can readily be determined by 
5 routine methods described herein and otherwise known in the art. 

Thus, the invention further includes polypeptide variants which show substantial 
biological activity. Such variants include deletions, insertions, inversions, repeats, and 
substitutions selected according to general rules known in the art so as have little effect on 
activity. For example, guidance concerning how to make phenotypically silent amino acid 
10 substitutions is provided in Bowie, J. U. et al., Science 247:1306-1310 (1990), wherein the 
authors indicate that there are two main strategies for studying the tolerance of an amino 
acid sequence to change. As the authors state, these two strategies, natural selection and 
genetic engineering, have revealed that proteins are surprisingly tolerant of amino acid 
substitutions. The authors further indicate which amino acid changes are likely to be 
15 permissive at certain amino acid positions in the protein. For example, most buried (within 
the tertiary structure of the protein) amino acid residues require nonpolar side chains, 
whereas few features of surface side chains are generally conserved. Moreover, tolerated 
conservative amino acid substitutions involve replacement of the aliphatic or hydrophobic 
amino acids Ala, Val, Leu and He; replacement of the hydroxyl residues Ser and Thr; 
20 replacement of the acidic residues Asp and Glu; replacement of the amide residues Asn and 
Gin, replacement of the basic residues Lys, Arg, and His; replacement of the aromatic 
residues Phe, Tyr, and Tip, and replacement of the small-sized amino acids Ala, Ser, Thr, 
Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
25 include (i) substitutions with one or more of the non-conserved amino acid residues, where 
the substituted amino acid residues may or may not be one encoded by the genetic code, or 
(ii) substitution with one or more of amino acid residues having a substituent group, or (iii) 
fusion of the mature polypeptide with another compound, such as a compound to increase 
the stability and/or solubility of the polypeptide (for example, polyethylene glycol), or (iv) 
30 fusion of the polypeptide with additional amino acids, such as an IgG Fc fusion region 

peptide, or leader or secretory sequence, or a sequence facilitating purification. Such variant 
polypeptides are deemed to be within the scope of those skilled in the art from the teachings 
herein. 
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For example, polypeptide variants containing amino acid substitutions of charged 
amino acids with other charged or neutral amino acids may produce proteins with improved 
characteristics, such as less aggregation. Aggregation of pharmaceutical formulations both 
reduces activity and increases clearance due to the aggregate's immunogenic activity. 
(Pinckard et al, Clin. Exp. Immunol. 2:331-340 (1967); Robbins et al., Diabetes 36: 838- 
845 (1987); Cleland et al., Crit. Rev. Therapeutic Drug Carrier Systems 10:307-377 
(1993).) 



Polynucleotide and Polypeptide Fragments 

10 In the present invention, a "polynucleotide fragment" refers to a short polynucleotide 

having a nucleic acid sequence contained in that shown in SEQ ID NO: 1-23. The short 
nucleotide fragments are preferably at least about 15 nt, and more preferably at least about 
20 nt, still more preferably at least about 30 nt, and even more preferably, at least about 40 
nt in length. A fragment "at least 20 nt in length," for example, is intended to include 20 or 

15 more contiguous bases from the cDNA sequence contained in that shown in SEQ ID NO: 1- 
23. These nucleotide fragments are useful as diagnostic probes and primers as discussed 
herein. Of course, larger fragments (e.g., 50, 150, and more nucleotides) are preferred. 

Moreover, representative examples of polynucleotide fragments of the invention, 
include, for example, fragments having a sequence from about nucleotide number 1-50, 51- 

20 230, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401-450, and so forth, to the 
end of SEQ ID NO: 1-23. In this context "about" includes the particularly recited ranges, 
larger or smaller by several (5, 4, 3, 2, or 1) nucleotides, at either terminus or at both 
termini. Preferably, these fragments encode a polypeptide which has biological activity. 
In the present invention, a "polypeptide fragment" refers to a short amino acid 

25 sequence contained in the translations of SEQ ED NO: 1-23 as well as SEQ ID NO:24-33. 
Protein fragments may be "free-standing," or comprised within a larger polypeptide of 
which the fragment forms a part or region, most preferably as a single continuous region. 
Representative examples of polypeptide fragments of the invention, include, for example, 
fragments from about amino acid number 1-20, 21-40, 41-60, and so forth, to the end of the 

30 coding region. Moreover, polypeptide fragments can be about 20, 30, 40, 50 or 60, amino 
acids in length. In this context "about" includes the particularly recited ranges, larger or 
smaller by several (5, 4, 3, 2, or 1) amino acids, at either extreme or at both extremes. 
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Preferred poKp^tide fragments include the secreted protein as well as the mature 
fo rm. Further preferred polypeptide fragments include the secreted protein or the mature 
form having a continuous series of deleted residues from the amino or the carboxy terminus, 
or both. For example, any number of amino acids, ranging from 1-60, can be deleted from 
5 the amino terminus of either the secreted polypeptide or the mature form. Similarly, any 
number of amino acids, ranging from 1-30, can be deleted from the carboxy. terminus of the 
secreted protein or mature form. Furthermore, any combination of the above amino and 
carboxy terminus deletions are preferred. Similarly, polynucleotide fragments encoding 
these polypeptide fragments are also preferred. 
!0 Also preferred are polypeptide and polynucleotide fragments characterized by 

structural or functional domains, such as fragments that comprise alpha-helix and alpha- 
helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- forming 
regions, coil and coil-forming regions, hydrophilic regions, hydrophobic regions, alpha 
amphipathic regions, beta amphipathic regions, flexible regions, surface-forming regions, 
15 substrate binding region, and high antigenic index regions. Polypeptide fragments of the 
translations of SEQ ID NO: 1-23 as well as SEQ ED NO:24-33 falling within conserved 
domains are specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 
20 fragments are those exhibiting activity similar, but not necessarily identical, to an activity of 
the polypeptide of the present invention. The biological activity of the fragments may 
include an improved desired activity, or a decreased undesirable activity. 



Epitopes 8l Antibodies 

25 In the present invention, "epitopes" refer to polypeptide fragments having antigenic 

or immunogenic activity in an animal, especially in a human. A preferred embodiment of 
the present invention relates to a polypeptide fragment comprising an epitope, as well as the 
polynucleotide encoding this fragment. A region of a protein molecule to which an antibody 
can bind is defined as an "antigenic epitope." In contrast, an "immunogenic epitope" is 

30 defined as a part of a protein that elicits an antibody response. (See, for instance, Geysen et 
al.,Proc. Natl. Acad. Sci. USA 81:3998-4002 (1983).) 
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Fragments whiartunction as epitopes may be produced by any conventional means. 
(See, e.g., Houghten. R. A.. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985) further 
described in U.S. Patent No. 4,631.21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at least 
5 seven, more preferably at least nine, and most preferably between about 1 5 to about 30 
amino acids. Antigenic epitopes are useful to raise antibodies, including monoclonal 
antibodies, that specifically bind the epitope. (See, for instance, Wilson et al., Cell 37:767- 
778 (1984); Sutcliffe, J. G. et aL, Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to 
10 methods well known in the art. (See, for instance, Sutcliffe et al., supra; Wilson et aL, supra; 
Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et aL, J. Gen. 
Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes the secreted 
protein. The immunogenic epitopes may be presented together with a carrier protein, such 
as an albumin, to an animal system (such as rabbit or mouse) or, if it is long enough (at least 
15 about 25 amino acids), without a carrier. However, immunogenic epitopes comprising as 
few as 8 to 10 amino acids have been shown to be sufficient to raise antibodies capable of 
binding to, at the very least, linear epitopes in a denatured polypeptide (e.g., in Western 
blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant 
20 to include intact molecules as well as antibody fragments (such as, for example, Fab and 
F(ab')2 fragments) which are capable of specifically binding to protein. Fab and F(ab')2 
fragments lack the Fc fragment of intact antibody, clear more rapidly from the circulation, 
and may have less non-specific tissue binding than an intact antibody. (Wahl et al., J. Nucl. 
Med. 24:316-325 (1983).) Thus, these fragments are preferred, as well as the products of a 
25 FAB or other immunoglobulin expression library. Moreover, antibodies of the present 
invention include chimeric, single chain, and humanized antibodies. 



Fusion Proteins 

Any polypeptide of the present invention can be used to generate fusion proteins. 

30 For example, the polypeptide of the present invention, when fused to a second protein, can 

be used as an antigenic tag. Antibodies raised against the polypeptide of the present 

invention can be used to indirectly detect the second protein by binding to the polypeptide. 

Moreover, because secreted proteins target cellular locations based on trafficking signals, 
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the pol>peptides of th^Tresent invention can be used as targeting molecules once fused to 
other proteins. 

Examples of domains that can be fused to polypeptides of the present invention 
include not only heterologous signal sequences, but also other heterologous functional 
5 regions. The fusion does not necessarily need to be direct, but may occur through linker 
sequences. 

Moreover, fusion proteins may also be engineered to improve characteristics of the 
polypeptide of the present invention. For instance, a region of additional amino acids, 
particularly charged amino acids, may be added to the N-terminus of the polypeptide to 

10 improve stability and persistence during purification from the host cell or subsequent 

handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate 
purification. Such regions may be removed prior to final preparation of the polypeptide. 
The addition of peptide moieties to facilitate handling of polypeptides are familiar and 
routine techniques in the art. 

15 Moreover, polypeptides of the present invention, including fragments, and 

specifically epitopes, can be combined with parts of the constant domain of 
immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins facilitate 
purification and show an increased half-life in vivo. One reported example describes 
chimeric proteins consisting of the first two domains of the human CD4-polypeptide and 

20 various domains of the constant regions of the heavy or light chains of mammalian 

immunoglobulins. (EP A 394,827; Traunecker et al., Nature 331:84-86 (1988).) Fusion 
proteins having disulfide-linked dimeric structures (due to the IgG) can also be more 
efficient in binding and neutralizing other molecules, than the monomelic secreted protein 
or protein fragment alone. (Fountoulakis et al., J. Biochem. 270:3958-3964 (1995).) 

25 Similarly, EP-A-0 464 533 (Canadian counterpart 2045869) discloses fusion 

proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a fusion 
protein is beneficial in therapy and diagnosis, and thus can result in* for example, improved 
pharmacokinetic properties. (EP-A 0 232 262.) Alternatively, deleting the Fc part after the 

30 fusion protein has been expressed, detected, and purified, would be desired. For example, 

the Fc portion may hinder therapy and diagnosis if the fusion protein is used as an antigen 

for immunizations. In drug discovery, for example, human proteins, such as hIL-5, have 

been fused with Fc portions for the purpose of high-throughput screening assays to identify 
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antagonists of hEL-5. (See, D. Bennett et ah, J. Molecular Recognition 8:52-58 (1995); K. 
Johanson et aL J. Biol. Chem. 270:9459-9471 (1995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
5 preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, such as 
the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 
91311), among others, many of which are commercially available. As described in Gentz et 
al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for instance, hexa-histidine provides 
for convenient purification of the fusion protein. Another peptide tag useful for purification, 
10 the "HA" tag, corresponds to an epitope derived from the influenza hemagglutinin protein. 
(Wilson et al., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the present invention. 

15 Vectors, Host Cells, and Protein Production 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant techniques. 
The vector may be, for example, a phage, plasmid, viral, or retroviral vector. Retroviral 
vectors may be replication competent or replication defective. In the latter case, viral 
20 propagation generally will occur only in complementing host cells. 

Nucleic Acid Molecules and Polypeptides of the Present Invention 

The nucleic acid molecules encompassed in the invention comprise nucleotide 

sequences SEQ ED NO: 1-23. 

25 The amino acid sequences of the polypeptides encoded by the nucleotide sequences 

of the invention are given in SEQ ID NO:24-33. 

The discovery of the nucleic acids of the invention enables the construction of 

expression vectors comprising nucleic acid sequences encoding polypeptides; host cells 

transfected or transformed with the expression vectors; isolated and purified biologically 

30 active polypeptides and fragments thereof; the use of the nucleic acids or oligonucleotides 

thereof as probes to identify nucleic acids encoding proteins having amino acid sequences 

homologous to SEQ ID NOs: 24-33; the use of the nucleic acids or oligonucleotides thereof 

to identify human chromosomes, for example, 22, 7, and 19; the use of the nucleic acids or 
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oligonucleotides thereof to map genes on human chromosomes, for example, 22, 7, and 19; 
the use of the nucleic acids or oligonucleotides thereof to identify genes associated with 
certain diseases, syndromes or other human conditions associated with human chromosomes 
such as 22, 7, and 19; the use of single-stranded sense or antisense oligonucleotides from the 
5 nucleic acids to inhibit expression of polynucleotides encoded by the [MX sequences; the 
use of such polypeptides and soluble fragments as molecular weight markers; the use of 
such polypeptides and fragmented peptides as controls for peptide fragmentation, and kits 
comprising these reagents; the use of such polypeptides and fragments thereof to generate 
antibodies, and the use of antibodies to purify EMX polypeptides. 

10 

NUCLEIC ACID MOLECULES 

In a particular embodiment, the invention relates to certain isolated nucleotide 
sequences that are free from contaminating endogenous material. A "nucleotide sequence" 
refers to a polynucleotide molecule in the form of a separate fragment or as a component of 

15 a larger nucleic acid construct. The nucleic acid molecule has been derived from DNA or 
RNA isolated at least once in substantially pure form and in a quantity or concentration 
enabling identification, manipulation, and recovery of its component nucleotide sequences 
by standard biochemical methods (such as those outlined in Sambrook et al., Molecular 
Cloning: A Laboratory Manual, 2nd sed., Cold Spring Harbor Laboratory, Cold Spring 

20 Harbor, NY (1989)). Such sequences are preferably provided and/or constructed in the form 
of an open reading frame uninterrupted by internal non-translated sequences, or introns, that 
are typically present in eukaryotic genes. Sequences of non-translated DNA can be present 
5 1 or 3' from an open reading frame, where the same do not interfere with manipulation or 
expression of the coding region. 

25 Nucleic acid molecules of the invention include DNA in both single-stranded and 

double-stranded form, as well as the RNA complement thereof. DNA includes, for 
example, cDNA, genomic DNA, chemically synthesized DNA, DNA amplified by PCR, 
and combinations thereof. Genomic DNA may be isolated by conventional techniques, e.g., 
using the cDNA of SEQ ID NOs:l-23, or a suitable fragment thereof, as a probe. 

30 The DNA molecules of the invention include full length genes as well as 

polynucleotides and fragments thereof. The full length gene may include the N-terminal 

signal peptide. Other embodiments include DNA encoding a soluble form, e.g., encoding 

the extracellular domain of the protein, either with or without the signal peptide. 
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The nucleic acids of the invention are preferentially derived from human sources. 




but the invention includes those derived from non-human species, as well. 



Preferred Sequences 

5 The particularly preferred nucleotide sequences of the invention are SEQ ED NOs: l- 

23, as set forth above. cDNA clones having the nucleotide sequence of SEQ ID NOs: I -23 
were isolated as described in Example 1. The sequences of amino acids encoded by the 
DNA of SEQ ID NOs:l-23 are shown in SEQ ID NOs:24-33. 

10 Additional Sequences 

Due to the known degeneracy of the genetic code, wherein more than one codon can 
encode the same amino acid, a DNA sequence can vary from that shown in SEQ ED NOs: 1- 
23, and still encode a polypeptide having the amino acid sequence of SEQ ID NOs:24-33. 
Such variant DNA sequences can result from silent mutations (e.g., occurring during PCR 

15 amplification), or can be the product of deliberate mutagenesis of a native sequence. 

The invention thus provides additional isolated DNA sequences encoding 
polypeptides of the invention, selected from: (a) DNA comprising the nucleotide sequences 
of SEQ ID NOs: 1-23; (b) DNA encoding the polypeptides of SEQ ID NOs:24-33; (c) DNA 
capable of hybridization to a DNA of (a) or (b) under conditions of moderate stringency and 

20 which encodes polypeptides of the invention; (d) DNA capable of hybridization to a DNA 
of (a) or (b) under conditions of high stringency and which encodes polypeptides of the 
invention, and (e) DNA which is degenerate, as a result of the genetic code, to a DNA 
defined in (a), (b), (c), or (d) and which encode polypeptides of the invention. Of course, 
polypeptides encoded by such DNA sequences are encompassed by the invention. 

25 As used herein, conditions of moderate stringency can be readily determined by 

those having ordinary skill in the art based on, for example, the length of the DNA. The 
basic conditions are set forth by Sambrook et al. Molecular Cloning: A Laboratory 
Manual, 2 ed. Vol. 1, pp. 1.101-234, Cold Spring Harbor Laboratory Press, (1989), and 
include use of a prewashing solution for the nitrocellulose filters SX SSC, 0.5% SDS, 1.0 

30 mM EDTA (pH 8.0), hybridization conditions of about 50% formamide, 6X SSC at about 

a 

42 C (or other similar hybridization solution, such as Stark's solution, in about 50% 

formamide at about 42°C), and washing conditions of about 60°C, 0.5X SSC, 0.1% SDS. 

Conditions of high stringency can also be readily determined by the skilled artisan based on, 
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for example, the length^TOie DNA. Generally, such conditions^^defined as hybridization 
conditions as above, and with washing at approximately 68C, 0.2X SSC. 0.1° 0 SDS. The 





skilled artisan will recognize that the temperature and wash solution salt concentration can 
be adjusted as necessary according to factors such as the length of the probe. 



fragments and polypeptides comprising inactivated N-glycosylation site(s), inactivated 
protease processing site(s), or conservative amino acid substitution(s), as described below. 

In another embodiment, the nucleic acid molecules of the invention also comprise 
nucleotide sequences that are at least 80% identical to a native sequence. Also 

10 contemplated are embodiments in which a nucleic acid molecule comprises a sequence that 
is at least 90% identical, at least 95% identical, at least 98% identical, at least 99% identical, 
or at least 99.9% identical to a native sequence. 

The percent identity may be determined by visual inspection and mathematical 
calculation. Alternatively, the percent identity of two nucleic acid sequences can be 

1 5 determined by comparing sequence information using the GAP computer program, version 
6.0 described by Devereux et al. {Nucl. Acids Res. 12:387, 1984) and available from the 
University of Wisconsin Genetics Computer Group (UWGCG). The preferred default 
parameters for the GAP program include: (1) a unary comparison matrix (containing a 
value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted 

20 comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 14:6745, 1986, as described 
by Schwartz and DayhofF, eds., Atlas of Protein Sequence and Structure, National 
Biomedical Research Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for each gap and 
an additional 0.10 penalty for each symbol in each gap; and (3) no penalty for end gaps. 
Other programs used by one skilled in the art of sequence comparison may also be used. 

25 The invention provides isolated nucleic acids useful in the production of 

polypeptides. Such polypeptides may be prepared by any of a number of conventional 
techniques. A DNA sequence encoding an IMX polypeptide or a desired fragment thereof 
may be subcloned into an expression vector for production of the polypeptide or fragment. 
The DNA sequence advantageously is fused to a sequence encoding a suitable leader or 

30 signal peptide. Alternatively, the desired fragment may be chemically synthesized using 
known techniques. DNA fragments also may be produced by restriction endonuc lease 
digestion of a full length cloned DNA sequence, and isolated by electrophoresis on agarose 
gels. If necessary, oligonucleotides that reconstruct the 5' or 3' terminus to a desired point 
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Also included as an embodiment of the invention is DNA encoding polypeptide 
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may be ligated to a DNA fragment generated by restriction enzyme digestion. Such 
oligonucleotides may additionally contain a restriction endonuclease cleavage site upstream 
of the desired coding sequence, and position an initiation codon (ATG) at the N-terminus of 
the coding sequence. 

5 The well-known polymerase chain reaction (PCR) procedure also may be employed 

to isolate and amplify a DNA sequence encoding a desired protein fragment. 
Oligonucleotides that define the desired termini of the DNA fragment are employed as 5' 
and 3' primers. The oligonucleotides may additionally contain recognition sites for 
restriction endonuc leases, to facilitate insertion of the amplified DNA fragment into an 
10 expression vector. PCR techniques are described in Saiki et al., Science 239:487 (1988); 
Recombinant DNA Methodology, Wu et al., eds.. Academic Press, Inc., San Diego (1989), 
pp. 189-196; and PCR Protocols: A Guide to Methods and Applications, Innis et al., eds., 
Academic Press, Inc. (1990). 



15 POLYPEPTIDES AND FRAGMENTS THEREOF 

The invention encompasses polypeptides and fragments thereof in various forms, 
including those that are naturally occurring or produced through various techniques such as 
procedures involving recombinant DNA technology. Such forms include, but are not 
limited to, derivatives, variants, and oligomers, as well as fusion proteins or fragments 

20 thereof. 



Polypeptides and Fragments Thereof 

The polypeptides of the invention include full length proteins encoded by the nucleic 
acid sequences set forth above. Particularly preferred polypeptides comprise the amino acid 

25 sequences of SEQ ID NOs:24-33. 

The polypeptides of the invention may include an N-terminal hydrophobic region 
that functions as a signal peptide, and may contain an extracellular domain, and may also 
contain a transmembrane region and a C-terminal cytoplasmic domain as well as a spacer 
region. Computer analysis may be used to predict the location of the signal peptide. 

30 For example, the isolated polypeptides of SEQ ID NO: 16 (IMX 28) and SEQ ID 

NO:21 (IMX 44) include an N-terminal hydrophobic region that functions as a signal 

peptide. Computer analysis predicts that the signal peptide corresponds to residues 1 to 1 7 

of SEQ ID NO: 16, and cleavage of the signal peptide of SEQ ID NO: 16 results in a mature 
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protein comprising amino acids 18 to 372. The signal peptide of IMX 44 corresponds to 
residues I to 37 of SEQ ID NO:21. The next most likely computer-predicted signal peptide 
cleavage sites (in descending order) would occur after amino acids 36, 26 and 27 of SEQ CD 
NO:21. Cleavage of the signal peptide at position 37 thus would yield a mature protein 
5 comprising amino acids 38 through 261 of SEQ ED NO:21. 

The skilled artisan will recognize that the above-described boundaries of such 
regions of the polypeptide are approximate. To illustrate, the boundaries of the mature 
protein (which may be predicted by using computer programs available for that purpose) 
may differ from those described above. 

10 The polypeptides of the invention may be membrane bound or they may be secreted 

and, thus, soluble. Soluble polypeptides are capable of being secreted from the cells in 
which they are expressed. In general, soluble polypeptides may be identified (and 
distinguished from non-soluble membrane-bound counterparts) by separating intact cells 
which express the desired polypeptide from the culture medium, e.g., by centrifugation, and 

1 5 assaying the medium (supernatant) for the presence of the desired polypeptide. The 

presence of polypeptide in the medium indicates that the polypeptide was secreted from the 
cells and thus is a soluble form of the protein. 

In one embodiment, the soluble polypeptides and fragments thereof comprise all or 
part of the extracellular domain, but lack the transmembrane region that would cause 

20 retention of the polypeptide on a cell membrane. A soluble polypeptide may include the 

cytoplasmic domain, or a portion thereof, as long as the polypeptide is secreted from the cell 
in which it is produced. 

In general, the use of soluble forms is advantageous for certain applications. 
Purification of the polypeptides from recombinant host cells is facilitated, since the soluble 

25 polypeptides are secreted from the cells. Further, soluble polypeptides are generally more 
suitable for intravenous administration. 

The invention also provides polypeptides and fragments of the extracellular domain 
that retain a desired biological activity. Particular embodiments are directed to polypeptide 
fragments that retain the ability to bind the native cognate, substrate, or counter-structure 

30 ("binding partner"). Such a fragment may be a soluble polypeptide, as described above. In 
another embodiment, the polypeptides and fragments advantageously include regions that 
are conserved in the family as described above. 
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^^are polypeptide fragments comprisin^^l 



Also provided her?m are polypeptide fragments comprisint^Tleast 20, or at least 30. 
contiguous amino acids of the sequence of SEQ ID NOs:24-33. Fragments derived from the 
cytoplasmic domain find use in studies of signal transduction, and in regulating cellular 
processes associated with transduction of biological signals. Polypeptide fragments also 
may be employed as immunogens, in generating antibodies. 



Variants 

Naturally occurring variants as well as derived variants of the polypeptides and 
fragments are provided herein. 

10 Variants may exhibit amino acid sequences that are at least 80% identical. Also 

contemplated are embodiments in which a polypeptide or fragment comprises an amino acid 
sequence that is at least 90% identical, at least 95° b identical, at least 98% identical, at least 
99% identical, or at least 99.9% identical to the preferred polypeptide or fragment thereof. 
Percent identity may be determined by visual inspection and mathematical calculation. 

1 5 Alternatively, the percent identity of two protein sequences can be determined by comparing 
sequence information using the GAP computer program, based on the algorithm of 
Needleman and Wunsch (J. Mol. Bio. 48:443, 1970) and available from the University of 
Wisconsin Genetics Computer Group (UWGCG). The preferred default parameters for the 
GAP program include: (1) a scoring matrix, blosum62, as described by Henikoff and 

20 Henikoff (Proc. Natl. Acad. Sci. USA 89:10915, 1992); (2) a gap weight of 12; (3) a gap 
length weight of 4; and (4) no penalty for end gaps. Other programs used by one skilled in 
the art of sequence comparison may also be used. 

The variants of the invention include, for example, those that result from alternate 
mRNA splicing events or from proteolytic cleavage. Alternate splicing of mRNA may, for 

25 example, yield a truncated but biologically active protein, such as a naturally occurring 
soluble form of the protein. Variations attributable to proteolysis include, for example, 
differences in the N- or C-termini upon expression in different types of host cells, due to 
proteolytic removal of one or more terminal amino acids from the protein (generally from 1- 
5 terminal amino acids). Proteins in which differences in amino acid sequence are 

30 attributable to genetic polymorphism (allelic variation among individuals producing the 
protein) are also contemplated herein. 

Additional variants within the scope of the invention include polypeptides that may 
be modified to create derivatives thereof by forming covalent or aggregative conjugates with 
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other chemical mcieties^uch as glycosyl groups, lipids, phosphate acetyl groups and the 
like. Covalent derivatives may be prepared by linking the chemical moieties to functional 
groups on amino acid side chains or at the N-terminus or C-terminus of a polypeptide. 
Conjugates comprising diagnostic (detectable) or therapeutic agents attached thereto are 
5 contemplated herein, as discussed in more detail below. 

Other derivatives include covalent or aggregative conjugates of the polypeptides 
with other proteins or polypeptides, such as by synthesis in recombinant culture as N- 
terminal or C-terminal fusions. Examples of fusion proteins are discussed below in 
connection with oligomers. Further, fusion proteins can comprise peptides added to 

10 facilitate purification and identification. Such peptides include, for example, poly-His or the 
antigenic identification peptides described in U.S. Patent No. 5,01 1,912 and in Hopp et al., 
Bio/Technology 6: 1204, 1988. One such peptide is the FLAG** peptide, Asp-Tyr-Lys- Asp- 
Asp-Asp- Asp-Lys, which is highly antigenic and provides an epitope reversibly bound by a 
specific monoclonal antibody, enabling rapid assay and facile purification of expressed 

1 5 recombinant protein. A murine hybridoma designated 4E1 1 produces a monoclonal 

antibody that binds the FLAG* peptide in the presence of certain divalent metal cations, as 
described in U.S. Patent 5,01 1,912, hereby incorporated by reference. The 4E1 1 hybridoma 
cell line has been deposited with the American Type Culture Collection under accession no. 
HB 9259. Monoclonal antibodies that bind the FLAG® peptide are available from Eastman 

20 Kodak Co., Scientific Imaging Systems Division, New Haven, Connecticut. 

Among the polypeptides provided herein are variants of native polypeptides that 
retain the native biological activity or the substantial equivalent thereof. One example is a 
variant that binds with essentially the same binding affinity as does the native form. 
Binding affinity can be measured by conventional procedures, e.g., as described in U.S. 

25 Patent No. 5,512,457 and as set forth below. 

Variants include polypeptides that are substantially homologous to the native form, 
but which have an amino acid sequence different from that of the native form because of 
one or more deletions, insertions or substitutions. Particular embodiments include, but are 
not limited to, polypeptides that comprise from one to ten deletions, insertions or 

30 substitutions of amino acid residues, when compared to a native sequence. 

A given amino acid may be replaced, for example, by a residue having similar 

physiochemicai characteristics. Examples of such conservative substitutions include 

substitution of one aliphatic residue for another, such as He, Val, Leu, or Ala for one 
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another; substitutions of one polar residue for another, such as between Lys and Arg, Glu 
and Asp. or Gin and Asn; or substitutions of one aromatic residue for another, such as Phe, 
Trp. or Tyr for one another. Other conserv ative substitutions, e.g., involving substitutions 
of entire regions having similar hydrophobicity characteristics, are well known. 
5 Similarly, the DNAs of the invention include variants that differ from a native DNA 

sequence because of one or more deletions, insertions or substitutions, but that encode a 
biologically active polypeptide. 

The invention further includes polypeptides of the invention with or without 
associated native-pattern glycosylation. Polypeptides expressed in yeast or mammalian 

10 expression systems (e.g., COS-1 or COS-7 cells) can be similar to or significantly different 
from a native polypeptide in molecular weight and glycosylation pattern, depending upon 
the choice of expression system. Expression of polypeptides of the invention in bacterial 
expression systems, such as E. coli, provides non-glycosylated molecules. Further, a given 
preparation may include multiple differentially glycosylated species of the protein. Glycosyl 

15 groups can be removed through conventional methods, in particular those utilizing 

glycopeptidase. In general, glycosylated polypeptides of the invention can be incubated 
with a molar excess of glycopeptidase (Boehringer Mannheim). 

Correspondingly, similar DNA constructs that encode various additions or 
substitutions of amino acid residues or sequences, or deletions of terminal or internal 

20 residues or sequences are encompassed by the invention. For example, N-glycosylation 
sites in the polypeptide extracellular domain can be modified to preclude glycosylation, 
allowing expression of a reduced carbohydrate analog in mammalian and yeast expression 
systems. N-glycosylation sites in eukaryotic polypeptides are characterized by an amino 
acid triplet Asn-X-Y, wherein X is any amino acid except Pro and Y is Ser or Thr. 

25 Appropriate substitutions, additions, or deletions to the nucleotide sequence encoding these 
triplets will result in prevention of attachment of carbohydrate residues at the Asn side 
chain. Alteration of a single nucleotide, chosen so that Asn is replaced by a different amino 
acid, for example, is sufficient to inactivate an N-glycosylation site. Alternatively, the Ser 
or Thr can by replaced with another amino acid, such as Ala. Known procedures for 

30 inactivating N-glycosylation sites in proteins include those described in U.S. Patent 

5,071,972 and EP 276,846, hereby incorporated by reference. 

In another example of variants, sequences encoding Cys residues that are not 

essential for biological activity can be altered to cause the Cys residues to be deleted or 
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replaced with other amm^cids, preventing formation of incorre^mtramolecular disulfide 
bridges upon folding or renaturation. 

Other variants are prepared by modification of adjacent dibasic amino acid residues, 
to enhance expression in yeast systems in which KJEX2 protease activity is present. EP 
212,914 discloses the use of site-specific mutagenesis to inactivate K_EX2 protease 
processing sites in a protein. KEX2 protease processing sites are inactivated by deleting, 
adding or substituting residues to alter Arg-Arg. Arg-Lys, and Lys-Arg pairs to eliminate the 
occurrence of these adjacent basic residues. Lys-Lys pairings are considerably less 
susceptible to KEX2 cleavage, and conversion of Arg-Lys or Lys-Arg to Lys-Lys represents 
a conservative and preferred approach to inactivating KEX2 sites. 



Encompassed by the invention are oligomers or fusion proteins that contain IMX 
polypeptides. Such oligomers may be in the form of covalently-linked or non-covalently- 
linked multimers, including dimers, trimers, or higher oligomers. As noted above, preferred 
polypeptides are soluble and thus these oligomers may comprise soluble polypeptides. In 
one aspect of the invention, the oligomers maintain the binding ability of the polypeptide 
components and provide therefor, bivalent, trivalent, etc., binding sites. 

One embodiment of the invention is directed to oligomers comprising multiple 
polypeptides joined via covalent or non-covalent interactions between peptide moieties 
fused to the polypeptides. Such peptides may be peptide linkers (spacers), or peptides that 
have the property of promoting oligomerizatioa. Leucine zippers and certain polypeptides 
derived from antibodies are among the peptides that can promote oligomerization of the 
polypeptides attached thereto, as described in more detail below. 



polypeptides fused to various portions of antibody-derived polypeptides (including the Fc 



al. (Nature 344:677, 1990); and Hollenbaugh and Aruflfo ("Construction of Immunoglobulin 
Fusion Proteins", in Current Protocols in Immunology, Suppl. 4, pages 10.19.1 - 10.19.1 1, 
1992). 



Oligomers 



Immunoglobulin-based Oligomers 
As one alternative, an oligomer is prepared using polypeptides derived from 
immunoglobulins. Preparation of fusion proteins comprising certain heterologous 



domain) has been described, e.g., by Ashkenazi et al. (PNAS USA 88: 10535, 1991); Bym et 
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One embodiment of the present invention is directed to a dimer comprising two 
fusion proteins created by fusing a polypeptide of the invention to an Fc polypeptide derived 
from an antibody. A gene fusion encoding the polypeptide.Fc fusion protein is inserted into 
an appropriate expression vector. Polypeptide/Fc fusion proteins are expressed in host cells 
5 transformed with the recombinant expression vector, and allowed to assemble much like 
antibody molecules, whereupon interchain disulfide bonds form between the Fc moieties to 
yield divalent molecules. 

The term "Fc polypeptide" as used herein includes native and mutein forms of 
polypeptides comprising from the Fc region of an antibody. Truncated forms of such 

10 polypeptides containing the hinge region that promotes dimerization are also included. 
Preferred polypeptides comprise an Fc polypeptide derived from a human IgGl antibody 
comprising any or all of the CH domains of the Fc region. 

One suitable Fc polypeptide, described in PCT application WO 93/1015 1, hereby 
incorporated by reference, is a single chain polypeptide extending from the N-terminal hinge 

15 region to the native C-terminus of the Fc region of a human IgGl antibody. Another use fill 
Fc polypeptide is the Fc mutein described in U.S. Patent 5,457,035 and in Baum et al., 
(EMBOJ. 13:3992-4001, 1994) incorporated herein by reference. The amino acid sequence 
of this mutein is identical to that of the native Fc sequence presented in WO 93/1015 1, 
except that amino acid 19 has been changed from Leu to Ala, amino acid 20 has been 

20 changed from Leu to Glu, and amino acid 22 has been changed from Gly to Ala. The 
mutein exhibits reduced affinity for Fc receptors. 

The above-described fusion proteins comprising Fc moieties (and oligomers formed 
therefrom) offer the advantage of facile purification by affinity chromatography over Protein 
A or Protein G columns. 

25 In other embodiments, the polypeptides of the invention may be substituted for the 

variable portion of an antibody heavy or light chain. If fusion proteins are made with both 
heavy and light chains of an antibody, it is possible to form an oligomer with as many as 
four soluble regions of the proteins of the invention. 
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de-linker Based Oligomers 
Alternatively, the oligomer is a fusion protein comprising multiple polypeptides, 
with or without peptide linkers (spacer peptides). Among the suitable peptide linkers are 
those described in U.S. Patents 4,751,180 and 4,935.233, which are hereby incorporated by- 
reference. A DNA sequence encoding a desired peptide linker may be inserted between, and 
in the same reading frame as, the DNA sequences of the invention, using any suitable 
conventional technique. For example, a chemically synthesized oligonucleotide encoding 
the linker may be ligated between the sequences. In particular embodiments, a fusion 
protein comprises from two to four soluble IMX polypeptides, separated by peptide linkers. 



Leucine-Zippers 

Another method for preparing the oligomers of the invention involves use of a 
leucine zipper. Leucine zipper domains are peptides that promote oligomerization of the 
proteins in which they are found. Leucine zippers were originally identified in several 

15 DNA-binding proteins (Landschulz et al., Science 240:1759, 1988), and have since been 
found in a variety of different proteins. Among the known leucine zippers are naturally 
occurring peptides and derivatives thereof that dimerize or trimerize. 

Examples of leucine zipper domains suitable for producing soluble oligomeric 
proteins are described in PCT application WO 94/10308, and the leucine zipper derived 

20 from lung surfactant protein D (SPD) described in Hoppe et al. (FEBS Letters 344:191, 

1 994), hereby incorporated by reference. The use of a modified leucine zipper that allows 
for stable trimerization of a heterologous protein fused thereto is described in Fanslow et al. 
{Semin. Immunol. 6:267-278, 1994). Recombinant fusion proteins comprising a soluble 
polypeptide fused to a leucine zipper peptide are expressed in suitable host cells, and the 

25 soluble oligomer that forms is recovered from the culture supernatant. 

In particular embodiments, leucine residues in a leucine zipper moiety are replaced 
by isoleucine residues. Such peptides comprising isoleucine may be referred to as 
"isoleucine zippers" but are encompassed by the term leucine zippers" as employed herein. 



30 PRODUCTION OF POLYPEPTIDES AND FRAGMENTS THEREOF 

Expression, isolation and purification of the polypeptides and fragments of the 
invention may be accomplished by any suitable technique, including but not limited to the 
following: 
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Expression Systems 

The present invention also provides recombinant cloning and expression vectors 
containing DNA, as well as host cell containing the recombinant vectors. Expression 
5 vectors comprising DNA may be used to prepare the polypeptides or fragments of the 

invention encoded by the DNA. A method for producing polypeptides comprises culturing 
host cells transformed with a recombinant expression vector encoding the polypeptide, 
under conditions that promote expression of the polypeptide, then recovering the expressed 
polypeptides from the culture. The skilled artisan will recognize that the procedure for 

10 purifying the expressed polypeptides will vary according to such factors as the type of host 
cells employed, and whether the polypeptide is membrane-bound or a soluble form that is 
secreted from the host cell. 

Any suitable expression system may be employed. The vectors include a DNA 
encoding a polypeptide or fragment of the invention, operably linked to suitable 

15 transcriptional or translational regulatory nucleotide sequences, such as those derived from a 
mammalian, microbial, viral, or insect gene. Examples of regulatory sequences include 
transcriptional promoters, operators, or enhancers, an mRNA ribosomal binding site, and 
appropriate sequences which control transcription and translation initiation and termination. 
Nucleotide sequences are operably linked when the regulatory sequence functionally relates 

20 to the DNA sequence. Thus, a promoter nucleotide sequence is operably linked to a DNA 
sequence if the promoter nucleotide sequence controls the transcription of the DNA 
sequence. An origin of replication that confers the ability to replicate in the desired host 
cells, and a selection gene by which trans formants are identified, are generally incorporated 
into the expression vector. 

25 In addition, a sequence encoding an appropriate signal peptide (native or 

heterologous) can be incorporated into expression vectors. A DNA sequence for a signal 
peptide (secretory leader) may be fused in frame to the nucleic acid sequence of the 
invention so that the DNA is initially transcribed, and the mRNA translated, into a fusion 
protein comprising the signal peptide. A signal peptide that is functional in the intended 

30 host cells promotes extracellular secretion of the polypeptide. The signal peptide is cleaved 

from the polypeptide upon secretion of polypeptide from the cell. 

The skiiled artisan will also recognize that the position(s) at which the signal peptide 

is cleaved may differ from that predicted by computer program, and may vary according to 
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such factors as the t\pe ofnost cells employed in expressing a recombinant polypeptide. A 
protein preparation may include a mixture of protein molecules having different N-terminal 
amino acids, resulting from cleavage of the signal peptide at more than one site. 

Suitable host cells for expression of polypeptides include prokaryotes, yeast or 
5 higher eukaryotic cells. Mammalian or insect cells are generally preferred for use as host 
cells. Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and 
mammalian cellular hosts are described, for example, in Pouwels et al. Cloning Vectors: A 
Laboratory Manual, Elsevier, New York, (1985). Cell-free translation systems could also 
be employed to produce polypeptides using RNAs derived from DNA constructs disclosed 



Prokaryotes include gram-negative or gram-positive organisms. Suitable prokaryotic 
host cells for transformation include, for example, E. coli, Bacillus subtilis, Salmonella 
15 typhimurium, and various other species within the genera Pseudomonas, Strep tomyces, and 
Staphylococcus. In a prokaryotic host cell, such as E. coli* a polypeptide may include an N- 
terminal methionine residue to facilitate expression of the recombinant polypeptide in the 
prokaryotic host cell. The N-terminal Met may be cleaved from the expressed recombinant 
polypeptide. 

20 Expression vectors for use in prokaryotic host cells generally comprise one or more 

phenotypic selectable marker genes. A phenotypic selectable marker gene is, for example, a 
gene encoding a protein that confers antibiotic resistance or that supplies an autotrophic 
requirement. Examples of useful expression vectors for prokaryotic host cells include those 
derived from commercially available plasmids such as the cloning vector pBR322 (ATCC 

25 3701 7). pBR322 contains genes for ampicillin and tetracycline resistance and thus provides 
simple means for identifying transformed cells. An appropriate promoter and a DNA 
sequence are inserted into the pBR322 vector. Other commercially available vectors 
include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and 
pGEMl (Promega Biotec, Madison, WI, USA). 

30 Promoter sequences commonly used for recombinant prokaryotic host cell 



expression vectors include 3 -lactamase (penicillinase), lactose promoter system (Chang et 
al., Nature 275:615, 1978; and Goeddel et al., Nature 281:544, 1979), tryptophan (trp) 
promoter system (Goeddel et al., Nucl. Acids Res. 5:4057, 1980; and EP-A-36776) and tac 



10 



herein. 



Prokaryotic Svstems 
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ular Cloning: A Laboratory- Manual. 




Spring Harbor 
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Laboratory, p. 412, 1982). A particularly useful prokaryotic host cell expression system 
employs a phage 8Pl promoter and a cI857ts thermolabile repressor sequence. Plasmid 
vectors available from the American Type Culture Collection which incorporate derivatives 
of the 8P L promoter include plasmid pHUB2 (resident in £. coli strain JMB9, ATCC 37092) 
and pPLc28 (resident in E. coli RR1, ATCC 53082). 



Alternatively, the polypeptides may be expressed in yeast host cells, preferably from 
the Saccharomyces genus (e.g., S. cerevisiae). Other genera of yeast, such as Pichia or 
Kluyveromyces, may also be employed. Yeast vectors will often contain an origin of 
replication sequence from a 2\x yeast plasmid, an autonomously replicating sequence (ARS), 
a promoter region, sequences for polyadenylation, sequences for transcription termination, 
and a selectable marker gene. Suitable promoter sequences for yeast vectors include, among 
others, promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. 
Chem. 255:2073, 1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7:149, 
1968; and Holland et al., Biochem. 77:4900, 1978), such as enolase, glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phospho-glucose isomerase, and glucokinase. Other suitable 
vectors and promoters for use in yeast expression are further described in Hitzeman, EPA- 
73,657. Another alternative is the glucose-repressible ADH2 promoter described by Russell 
et al. (J. Biol. Chem. 258:261 A, 1982) and Beier et al. (Nature 300:724, 1982). Shuttle 
vectors replicable in both yeast and E. coli may be constructed by inserting DNA sequences 
from pBR322 for selection and replication in E. coli (Amp r gene and origin of replication) 
into the above-described yeast vectors. 

The yeast V-factor leader sequence may be employed to direct secretion of the 
polypeptide. The V-rfactor leader sequence is often inserted between the promoter sequence 
and the structural gene sequence. See, e.g., Kurjan et al., Cell 50:933, 1982 and Bitter et al., 
Proc. Natl. Acad. Sci. USA <S/:5330, 1984. Other leader sequences suitable for facilitating 
secretion of recombinant polypeptides from yeast hosts are known to those of skill in the art. 
A leader sequence may be modified near its 3* end to contain one or more restriction sites. 
This will facilitate fusion of the leader sequence to the structural gene. 



Yeast Systems 



# • 

Yeast trans formaWon protocols are known to those of skilHn the art. One such 
protocol is described by Hinnen et a!., Proc. Natl. Acad. Sci. USA 75:1929, 1978. The 
Hinnen et al. protocol selects for Trp~ trans formants in a selective medium, wherein the 
selective medium consists of 0.67° o yeast nitrogen base, 0.5% casamino acids, 2% glucose, 
10 mg/ml adenine and 20 mg/ml uracil. 

Yeast host cells transformed by vectors containing an ADH2 promoter sequence may 
be grown for inducing expression in a "rich" medium. An example of a rich medium is one 
consisting of 1% yeast extract, 2% peptone, and 1% glucose supplemented with 80 mg/ml 
adenine and 80 mg/ml uracil. Derepression of the ADH2 promoter occurs when glucose is 
exhausted from the medium. 



Mammalian or Insect Systems 

Mammalian or insect host cell culture systems also may be employed to express 
recombinant polypeptides. Bacculovirus systems for production of heterologous proteins in 
insect cells are reviewed by Luckow and Summers, Bio/Technology 6:47 (1988). 
Established cell lines of mammalian origin also may be employed. Examples of suitable 
mammalian host cell lines include the COS-7 line of monkey kidney cells (ATCC CRL 
1651) (Gluzman et al., Cell 23:175, 1981), L cells, CI 27 cells, 3T3 cells (ATCC CCL 163), 
Chinese hamster ovary (CHO) cells, HeLa cells, and BHK (ATCC CRL 10) cell lines, and 
the CV1/EBNA cell line derived from the African green monkey kidney cell line CV1 
(ATCC CCL 70) as described by McMahan et al. (EMBOJ. 10: 2821, 1991). 

Established methods for introducing DNA into mammalian cells have been 

described (Kaufman, R.J., Large Scale Mammalian Cell Culture, 1990, pp. 15-69). 

Additional protocols using commercially available reagents, such as Lipofectamine lipid 

reagent (Gibco/BRL) or Lipofectamine-Plus lipid reagent, can be used to trans feet cells 

(Feigner et al., Proc. Natl. Acad. ScL USA 84: 7413-7417, 1987). In addition, 

electroporation can be used to transfect mammalian cells using conventional procedures, 

such as those in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2 ed. Vol. 1-3, 

Cold Spring Harbor Laboratory Press, 1989). Selection of stable transformants can be 

performed using methods known in the art, such as, for example, resistance to cytotoxic 

drugs. Kaufman et al., Meth. in Enzymology 755:487-5 1 1, 1990, describes several selection 

schemes, such as dihydrofolate reductase (DHFR) resistance. A suitable host strain for 

DHFR selection can be CHO strain DX-B 1 1 , which is deficient in DHFR (Urlaub and 

39 




Chasin, Proc. NatL AcWFSci. CSA ""4216-4220, 1980). A pSPrid expressing the DHFR 
cDNA can be introduced into strain DX-B1 1, and only cells that contain the plasmid can 
grow in the appropriate selective media. Other examples of selectable markers that can be 
incorporated into an expression vector include cDNAs conferring resistance to antibiotics, 
5 such as G4 1 8 and hygromycin B. Cells harboring the vector can be selected on the basis of 
resistance to these compounds. 

Transcriptional and translational control sequences for mammalian host cell 
expression vectors can be excised from viral genomes. Commonly used promoter 
sequences and enhancer sequences are derived from polyoma virus, adenovirus 2, simian 
10 virus 40 (SV40), and human c\tomegalovirus. DNA sequences derived from the SV40 viral 
genome, for example, SV40 origin, early and late promoter, enhancer, splice, and 
polyadenylation sites can be used to provide other genetic elements for expression of a 
structural gene sequence in a mammalian host cell. Viral early and late promoters are 
particularly useful because both are easily obtained from a viral genome as a fragment, 
15 which can also contain a viral origin of replication (Fiers et al., Nature 273:1 13, 1978; 

Kaufman, Meth, in Enzymology, 1990). Smaller or larger SV40 fragments can also be used, 
provided the approximately 250 bp sequence extending from the Hind m site toward the Bgl 
I site located in the SV40 viral origin of replication site is included. 

Additional control sequences shown to improve expression of heterologous genes 
20 from mammalian expression vectors include such elements as the expression augmenting 
sequence element (EASE) derived from CHO cells (Morris et al., Animal Cell Technology, 
1997, pp. 529-534 and PCT Application WO 97/25420) and the tripartite leader (TPL) and 
VA gene RNAs from Adenovirus 2 (Gingeras et al., J. Biol. Chem. 257:13475-13491, 
1982). The internal ribosome entry site (IRES) sequences of viral origin allows dicistronic 
25 mRNAs to be translated efficiently (Oh and Sarnow, Current Opinion in Genetics and 

Development J:295-300, 1993; Ramesh et al., Nucleic Acids Research 24:2697-2700, 1996). 
Expression of a heterologous cDNA as part of a dicistronic mRNA followed by the gene for 
a selectable marker (e.g. DHFR) has been shown to improve trans fectability of the host and 
expression of the heterologous cDNA (Kaufman, Meth. in Enzymology, 1990). Exemplary 
30 expression vectors that employ dicistronic mRNAs are pTR-DC/GFP described by Mosser 
et al., Biotechniques 22:150-161, 1997, and p2A5I described by Morris et al., Animal Cell 
Technology, 1997. pp. 529-534. 
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s^Pression vector, pCAVNOT, has been delPft 



A useful high e^ffession vector, pCAVNOT, has been delPibed by Vosley et al., 
Cell 59:335-348, 1989. Other expression vectors for use in mammalian hosi cells can be 
constructed as disclosed by Okayama and Berg (Afol. Cell. Biol. i:280, 19S3 ». A useful 
system for stable high level expression of mammalian cDNAs in C 127 murine mammary 
epithelial cells can be constructed substantially as described by Cosman et al. «.l/o/. 
Immunol. 23:935, 1986). A useful high expression vector, PMLSV N1/N4, described by 
Cosman et al, Nature 312:16%, 1984, has been deposited as ATCC 39890. Additional 
useful mammalian expression vectors are described in EP-A-0367566, and in WO 
91 18982, incorporated by reference herein. In yet another alternative, the vectors can be 
derived from retroviruses. 

Another useful expression vector, pFLAG*. can be used. FLAG 8 technology is 
centered on the fusion of a low molecular weight (lkD), hydrophilic, FLAG* marker 
peptide to the N-terminus of a recombinant protein expressed by pFLAG 8 expression 
vectors. 

Regarding signal peptides that may be employed, the native signal pepdde may be 
replaced by a heterologous signal peptide or leader sequence, if desired. The choice of 
signal peptide or leader may depend on factors such as the type of host cells in which the 
recombinant polypeptide is to be produced. To illustrate, examples of heterologous signal 
peptides that are functional in mammalian host cells include the signal sequence for 
interleukin-7 (IL-7) described in United States Patent 4,965,195, the signal sequence for 
interleukin-2 receptor described in Cosman et al., Nature 312:768 (1984); the interleukin-4 
receptor signal peptide described in EP 367,566; the type I interleukin- 1 receptor signal 
peptide described in U.S. Patent 4,968,607; and the type II interleukin- 1 receptor signal 
peptide described in EP 460,846. 

Purification 

The invention also includes methods of isolating and purifying the polypeptides and 
fragments thereof. 
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Isolation and Purification 
The' 'isolated' 1 polypeptides or fragments thereof encompassed by this invention are 
pohpeptides or fragments that are not in an environment identical to an environment in 
which it or they can be found in nature. The "purified' 1 polypeptides or fragments thereof 
encompassed by this invention are essentially free of association with other proteins or 
polypeptides, for example, as a purification product of recombinant expression systems such 
as those described above or as a purified product from a non-recombinant source such as 
naturally occurring cells and/or tissues. 

In one preferred embodiment, the purification of recombinant polypeptides or 
fragments can be accomplished using fusions of polypeptides or fragments of the invention 
to another polypeptide to aid in the purification of polypeptides or fragments of the 
invention. Such fusion partners can include the poly-His or other antigenic identification 
peptides described above as well as the Fc moieties described previously. 

With respect to any type of host cell, as is known to the skilled artisan, procedures 
for purifying a recombinant polypeptide or fragment will vary according to such factors as 
the type of host cells employed and whether or not the recombinant polypeptide or fragment 
is secreted into the culture medium. 

In general, the recombinant polypeptide or fragment can be isolated from the host 
cells if not secreted, or from the medium or supernatant if soluble and secreted, followed by 
one or more concentration, salting-out, ion exchange, hydrophobic interaction, affinity 
purification or size exclusion chromatography steps. As to specific ways to accomplish 
these steps, the culture medium first can be concentrated using a commercially available 
protein concentration filter, for example, an Amicon or Millipore Pellicon ultrafiltration 
unit. Following the concentration step, the concentrate can be applied to a purification 
matrix such as a gel filtration medium. Alternatively, an anion exchange resin can be 
employed, for example, a matrix or substrate having pendant diethylaminoethyl (DEAE) 
groups. The matrices can be acrylamide, agarose, dextran, cellulose or other types 
commonly employed in protein purification. Alternatively, a cation exchange step can be 
employed. Suitable cation exchangers include various insoluble matrices comprising 
sulfopropyl or carboxymethyl groups. In addition, a chromatofocusing step can be 
employed. Alternatively, a hydrophobic interaction chromatography step can be employed. 
Suitable matrices can be phenyl or octyl moieties bound to resins. In addition, affinity 

chromatography with a matrix which selectively binds the recombinant protein can be 
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f^fth resins emploved are lectin columns^ve 



employed. Examples ofTffch resins employed are lectin columns^ye columns, and metal- 
chelaiing columns. Finally, one or more reverse-phase high performance liquid 
chromatography (RP-HPLC) steps emploxing hydrophobic RP-HPLC media, (e.g.. silica gel 
or polymer resin having pendant methyl, octyl, octyldecyl or other aliphatic groups) can be 
5 employed to further purify the polypeptides. Some or all of the foregoing purification steps, 
in various combinations, are well known and can be employed to provide an isolated and 
purified recombinant protein. 

It is also possible to utilize an affinity column comprising a polypeptide-binding 
protein of the invention, such as a monoclonal antibody generated against polypeptides of 

10 the invention, to affinity-purify expressed polypeptides. These polypeptides can be removed 
from an affinity column using conventional techniques, e.g., in a high salt elution buffer and 
then dialyzed into a lower salt buffer for use or by changing pH or other components 
depending on the affinity matrix utilized, or be competitively removed using the naturally 
occurring substrate of the affinity moiety, such as a polypeptide derived from the invention. 

15 In this aspect of the invention, polypeptide-binding proteins, such as the anti- 

polypeptide antibodies of the invention or other proteins that may interact with the 
polypeptide of the invention, can be bound to a solid phase support such as a column 
chromatography matrix or a similar substrate suitable for identifying, separating, or 
purifying cells that express polypeptides of the invention on their surface. Adherence of 

20 polypeptide-binding proteins of the invention to a solid phase contacting surface can be 

accomplished by any means. For example, magnetic microspheres can be coated with these 
polypeptide-binding proteins and held in the incubation vessel through a magnetic field. 
Suspensions of cell mixtures are contacted with the solid phase that has such polypeptide- 
binding proteins thereon. Cells having polypeptides of the invention on their surface bind to 

25 the fixed polypeptide-binding protein and unbound cells then are washed away. This 

affinity-binding method is useful for purifying, screening, or separating such polypeptide- 
expressing cells from solution. Methods of releasing positively selected cells from the solid 
phase are known in the art and encompass, for example, the use of enzymes. Such enzymes 
are preferably non-toxic and non-injurious to the cells and are preferably directed to 

30 cleaving the cell-surface binding partner. 

Alternatively, mixtures of cells suspected of containing polypeptide-expressing cells 

of the invention first can be incubated with a biotinylated polypeptide-binding protein of the 

invention. Incubation periods are typically at least one hour in duration to ensure sufficient 
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binding to polypeptides of the invention. The resulting mixture then is passed through a 
column packed with avidin-coated beads, whereby the high affinity of biotin for avidin 
provides the binding of the polypeptide-binding cells to the beads. Use of avidin-coated 
beads is known in the art. See Berenson, et ah J. Cell. Biochem. % 10D:239 (1986). Wash of 
5 unbound material and the release of the bound cells is performed using conventional 
methods. 

The desired degree of purity depends on the intended use of the protein. A relatively 
high degree of purity is desired when the polypeptide is to be administered in vivo, for 
example. In such a case, the polypeptides are purified such that no protein bands 

10 corresponding to other proteins are detectable upon analysis by SDS-polyacrylamide gel 

electrophoresis (SDS-PAGE). It will be recognized by one skilled in the pertinent field that 
multiple bands corresponding to the polypeptide may be visualized by SDS-PAGE, due to 
differential glycosylation, differential post-trans lational processing, and the like. Most 
preferably, the polypeptide of the invention is purified to substantial homogeneity, as 

15 indicated by a single protein band upon analysis by SDS-PAGE. The protein band may be 
visualized by silver staining, Coomassie blue staining, or (if the protein is radiolabeled) by 
autoradiography. 

Assays 

20 The purified polypeptides of the invention (including proteins, polypeptides, 

fragments, variants, oligomers, and other forms) may be tested for the ability to bind a 
cognate, ligand, receptor, substrate, or counter-structure and the like ("binding partner") in 
any suitable assay, such as a conventional binding assay. To illustrate, the polypeptide may 
be labeled with a detectable reagent (e.g., a radionuclide, chromophore, enzyme that 

25 catalyzes a colorimetric or fluorometric reaction, and the like). The labeled polypeptide is 
contacted with cells expressing the binding partner. The cells then are washed to remove 
unbound labeled polypeptide, and the presence of cell-bound label is determined by a 
suitable technique, chosen according to the nature of the label. 

One example of a binding assay procedure is as follows. A recombinant expression 

30 vector containing the binding partner cDNA is constructed using methods known in the art. 

CV1-EBNA-1 cells in 10 cm 2 dishes are transfected with the recombinant expression vector. 

CV- 1 EBNA-1 cells (ATCC CRL 10478) constitutively express EBV nuclear antigen- 1 

driven from the CMV immediate-early enhancer/promoter. CV1 -EBNA-1 was derived from 
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the African Green Monl^pridney cell line CV-1 (ATCC CCL l^mm described by 
McMahan et al. (EMBOJ. 10:2821. 1991). 

The transfected cells are cultured for 24 hours, and the cells in each dish then are 

c 

split into a 24-well plate. After culturing an additional 48 hours, the transfected cells (about 
5 4 x 10 4 cells well) are washed with BM-NFDM. which is binding medium (RPMI 1640 
containing 25 mg/ml bovine serum albumin. 2 mg'ml sodium azide, 20 mM Hepes pH 7.2) 
to which 50 mg/ml nonfat dry milk has been added. The cells then are incubated for I hour 
at 37°C with various concentrations of, for example, a soluble polypeptide/Fc fusion protein 
made as set forth above. Cells then are washed and incubated with a constant saturating 

10 concentration of a l '"I-mouse anti-human IgG in binding medium, with gentle agitation for 1 
hour at 37°C. After extensive washing, cells are released via trypsinization. 

The mouse anti-human IgG employed above is directed against the Fc region of 
human IgG and can be obtained from Jackson Immunoresearch Laboratories, Inc., West 
Grove, PA. The antibody is radioiodinated using the standard chloramine-T method. The 

1 5 antibody will bind to the Fc portion of any polypeptide/Fc protein that has bound to the 

cells. In all assays, non-specific binding of 125 I-antibody is assayed in the absence of the Fc 
fusion protein/Fc, as well as in the presence of the Fc fusion protein and a 200-fold molar 
excess of unlabeled mouse anti-human IgG antibody. 

Cell-bound ! ~ 5 I-antibody is quantified on a Packard Autogamma counter. Affinity 

20 calculations (Scatchard, Ann. M Y. Acad. Sci. 5 1 :660, 1949) are generated on RS/1 (BBN 
Software, Boston, MA) run on a Microvax computer. 

Another type of suitable binding assay is a competitive binding assay. To illustrate, 
biological activity of a variant may be determined by assaying for the variant's ability to 
compete with the native protein for binding to the binding partner. 

25 Competitive binding assays can be performed by conventional methodology. 

Reagents that may be employed in competitive binding assays include radiolabeled EMX 
polypeptides and intact cells expressing the EMX polypeptide (endogenous or recombinant) 
on the cell surface. For example, a radiolabeled soluble IMX polypeptide fragment can be 
used to compete with a soluble IMX polypeptide variant for binding to the cell surface 

30 binding partner. Instead of intact cells, one could substitute a soluble binding partner/Fc 

fusion protein bound to a solid phase through the interaction of Protein A or Protein G (on 

the solid phase) with the Fc moiety. Chromatography columns that contain Protein A and 

Protein G include those available from Pharmacia Biotech, Inc., Piscataway, NJ. 
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Another type orWmpetitive binding assay utilizes the radrmabeled soluble binding 
partner, such as a soluble binding partner Tc fusion protein, and intact cells expressing the 
IMX polypeptide. Qualitative results can be obtained by competitive autoradiographic plate 
binding assays, while Scatchard plots (Scatchard, Ann. \ r .Y. Acad. Sci. 51:660, 1949) may 
be utilized to generate quantitative results. 



USE OF IMX NUCLEIC ACIDS OR OLIGONUCLEOTIDES 

In addition to being used to express polypeptides as described above, the nucleic 
acids of the invention, including DNA, and oligonucleotides thereof can be used: 
3 as probes to identify nucleic acid encoding proteins homologous to 

IMX polypeptides; 
3 to identify human chromosomes; 

3 to map genes on human chromosome numbers 7, 19, and 22; 

3 to identify genes associated with certain diseases, syndromes, or other 

conditions associated with human chromosome numbers 7, 19, and 

22; 

3 as single- stranded sense or antisense oligonucleotides, to inhibit expression 

of polypeptide encoded by the IMX sequences; 
3 to help detect defective genes in an individual; and 
3 for gene therapy. 

Probes 

Among the uses of nucleic acids of the invention is the use of fragments as probes or 
primers. Such fragments generally comprise at least about 17 contiguous nucleotides of a 
DNA sequence. In other embodiments, a DNA fragment comprises at least 30, or at least 
60, contiguous nucleotides of a DNA sequence. 

Because homologs of SEQ ID NOs:l-23, from other mammalian species, are 
contemplated herein, probes based on the human DNA sequence of SEQ ID NOs: 1-23 may 
be used to screen cDNA libraries derived from other mammalian species, using 
conventional cross-species hybridization techniques. 

Using knowledge of the genetic code in combination with the amino acid sequences 
set forth above, sets of degenerate oligonucleotides can be prepared. Such oligonucleotides 
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are useful as primers, e^^m polymerase chain reactions (PCR)/wlreby DNA fragments 
are isolated and amplified. 



Identifying Chromosome Number 
5 All or a portion of the nucleic acids of SEQ ED NOs:l-23, including 

oligonucleotides, can be used by those skilled in the art using well-known techniques to 
identify the human chromosomes, and the specific locus thereof, that contain the DNA of 
IMX family members. Useful techniques include, but are not limited to, using the sequence 
or portions, including oligonucleotides, as a probe in various well-known techniques such as 

10 in situ hybridization to chromosome spreads, Southern blot hybridization to hybrid cell 
lines, fluorescent tagging, and radiation hybrid mapping. 

For example, chromosomes can be mapped by radiation hybridization. 
PCR is performed using the Whitehead Institute/MIT Center for Genome Research 
Genebridge4 panel of 93 radiation hybrids (http://www-genome.wi.mit.edu/ftp/distribution/* 

15 human - STS_releases/july97/rhmap/genebridge4.html). Primers are used which lie within a 
putative exon of the gene of interest and which amplify a product from human genomic 
DNA, but do not amplify hamster genomic DNA. The results of the PCRs are converted 
into a data vector that is submitted to the Whitehead/MTT Radiation Mapping site on the 
internet (http://w\w-seq.wi. mit.edu). The data is scored and the chromosomal assignment 

20 and placement relative to known Sequence Tag Site (STS) markers on the radiation hybrid 
map is provided. The following web site provides additional information about radiation 
hybrid mapping: 

http://www-genome.wi.mit.edu/ftp/distribution/human_STS_releases/july97/ 
07-97.INTRO.html). 

25 

Identifying Associated Diseases 

As described "previously, IMX molecules numbered 4, 21, 44, and 56 have been 

mapped to particular chromosome locations. Thus, the nucleic acid of a particular IMX 

molecule or a fragment thereof can be used by one skilled in the art using well-known 

30 techniques to analyze abnormalities associated with gene mapping to such chromosomes. 

This enables one to distinguish conditions in which this marker is rearranged or deleted. In 

addition, nucleotides of such IMX molecules cr fragments thereof can be used as a 

positional marker to map other genes of previously unknown location. 
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The DNA may b^Tsed in developing treatments for any disorder mediated (directly 
or indirectly) by defective, or insufficient amounts of, the genes corresponding to the nucleic 
acids of the invention. Disclosure herein of native nucleotide sequences permits the 
detection of defective genes, and the replacement thereof with normal genes. Defective 
genes may be detected in in vitro diagnostic assays, and by comparison of a native 
nucleotide sequence disclosed herein with that of a gene derived from a person suspected of 
harboring a defect in this gene. 

Sense- Antisense 

Other useful fragments of the nucleic acids include antisense or sense 
oligonucleotides comprising a single-stranded nucleic acid sequence (either RNA or DNA) 
capable of binding to target mRNA (sense) or DNA (antisense) sequences. Antisense or 
sense oligonucleotides according to the present invention comprise a fragment of DNA 
(SEQ ID NOs: 1-23). Such a fragment generally comprises at least about 14 nucleotides, 
preferably from about 14 to about 30 nucleotides. The ability to derive an antisense or a 
sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described 
in, for example, Stein and Cohen {Cancer Res. 48:2659, 1988) and van der Krol et al. 
{BioTechniques 6:958, 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences 
results in the formation of duplexes that block or inhibit protein expression by one of 
several means, including enhanced degradation of the mRNA by RNAseH, inhibition of 
splicing, premature termination of transcription or translation, or by other means. The 
antisense oligonucleotides thus may be used to block expression of proteins. Antisense or 
sense oligonucleotides further comprise oligonucleotides having modified sugar- 
phosphcdiester backbones (or other sugar linkages, such as those described in 
W09 1/06629) and wherein such sugar linkages are resistant to endogenous nucleases. Such 
oligonucleotides with resistant sugar linkages are stable in vivo (i.e., capable of resisting 
enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide 
sequences. 

Other examples of sense or antisense oligonucleotides include those . 

oligonucleotides which are covalently linked to organic moieties, such as those described in 

WO 90/10448, and other moieties that increases affinity of the oligonucleotide for a target 

nucleic acid sequence, such as poly-(L-lysine). Further still, intercalating agents, such as 
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eilipticine, and alkylating"agents or metal complexes may be attached to sense or antisense 
oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for 
the target nucleotide sequence. 

Antisense or sense oligonucleotides may be introduced into a cell containing the 
target nucleic acid sequence by any gene transfer method, including, for example, 
lipofection, CaPOa-mediated DNA transfection, electroporation, or by using gene transfer 
vectors such as Epstein-Barr virus. 

Sense or antisense oligonucleotides also may be introduced into a cell containing the 
target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as 
described in WO 91/04753. Suitable ligand binding molecules include, but are not limited 
to, cell surface receptors, growth factors, other cytokines, or other ligands that bind to cell 
surface receptors. Preferably, conjugation of the ligand binding molecule does not 
substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell 
containing the target nucleic acid sequence by formation of an oligonucleotide-lipid 
complex, as described in WO 90/10448. The sense or antisense oligonucleotide-lipid 
complex is preferably dissociated within the cell by an endogenous lipase. 

USE OF IMX POLYPEPTIDES AND FRAGMENTED POLYPEPTIDES 

Uses include, but are not limited to, the following: 

Purifying proteins and measuring activity thereof 
Delivery Agents 

Therapeutic and Research Reagents 
Molecular weight and Isoelectric focusing markers 
Controls for peptide fragmentation 
Identification of unknown proteins 
Preparation of Antibodies 
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Purification Reaggms 

Each of the poKpeptides of the invention finds use as a protein purification reagent. 
The polypeptides may be attached to a solid support material and used to purify (binding 
partner) proteins by affinity chromatography. In particular embodiments, a polypeptide (in 
any form described herein that is capable of binding (binding partner)) is attached to a solid 
support by conventional procedures. As one example, chromatography columns containing 
functional groups that will react with functional groups on amino acid side chains of 
proteins are available (Pharmacia Biotech^ Inc., Piscataway, NJ). In an alternative, a 
polypeptide/Pc protein (as discussed above) is attached to Protein A- or Protein G- 
containing chromatography columns through interaction with the Fc moiety. 

The polypeptide also finds use in purifying or identifying cells that express (binding 
partner) on the cell surface. Polypeptides are bound to a solid phase such as a column 
chromatography matrix or a similar suitable substrate. For example, magnetic microspheres 
can be coated with the polypeptides and held in an incubation vessel through a magnetic 
field. Suspensions of cell mixtures containing (binding partner) expressing cells are 
contacted with the solid phase having the polypeptides thereon. Cells expressing (binding 
partner) on the cell surface bind to the fixed polypeptides, and unbound cells then are 
washed away. 

Alternatively, the polypeptides can be conjugated to a detectable moiety, then 
incubated with cells to be tested for (binding partner) expression. After incubation, 
unbound labeled matter is removed and the presence or absence of the detectable moiety on 
the cells is determined. 

In a further alternative, mixtures of cells suspected of containing (binding partner) 
cells are incubated with biotinylated polypeptides. Incubation periods are typically at least 
one hour in duration to ensure sufficient binding. The resulting mixture then is passed 
through a column packed with avidin-coated beads, whereby the high affinity of biotin for 
avidin provides binding of the desired cells to the beads. Procedures for using avidin-coated 
beads are known (see Berenson, et al. J. Cell. Biochem. y 10D:239, 1986). Washing to 
remove unbound material, and the release of the bound cells, are performed using 
conventional methods. 

Measuring Activity 
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Polypeptides als<Tnnd use in measuring the biological activity of (binding partner) 
protein in terms of their binding affinity. The polypeptides thus may be employed by those 
conducting "quality assurance" studies, e.g., to monitor shelf life and stability of protein 
under different conditions. For example, the polypeptides may be employed in a binding 
affinity study to measure the biological activity of a (binding partner) protein that has been 
stored at different temperatures, or produced in different cell types. The proteins also may 
be used to determine whether biological activity is retained after modification of a (binding 
partner) protein (e.g., chemical modification, truncation, mutation, etc.). The binding 
affinity of the modified (binding partner) protein is compared to that of an unmodified 
(binding partner) protein to detect any adverse impact of the modifications on biological 
activity^ of (binding partner). The biological activity of a (binding partner) protein thus can 
be ascertained before it is used in a research study, for example. 



Delivery Agents 

The polypeptides also find use as carriers for delivering agents attached thereto to 
cells bearing binding partner. Cells expressing (binding partner) include those identified in 
(add citation if reference known). The polypeptides thus can be used to deliver diagnostic 
or therapeutic agents to such cells (or to other cell types found to express (binding partner) 
on the cell surface) in in viiro or in vivo procedures. 

Detectable (diagnostic) and therapeutic agents that may be attached to a polypeptide 
include, but are not limited to, toxins, other cytotoxic agents, drugs, radionuclides, 
chromophores, enzymes that catalyze a colorimetric or fluorometric reaction, and the like, 
with the particular agent being chosen according to the intended application. Among the 
toxins are ricin, abrin, diphtheria toxin, Pseudomonas aeruginosa exotoxin A, ribosomal 
inactivating proteins, mycotoxins such as trichothecenes, and derivatives and fragments 
(e.g., single chains) thereof. Radionuclides suitable for diagnostic use include, but are not 
limited to, l23 I, l3l I, ""Tc, 1 1 'in, and 76 Br. Examples of radionuclides suitable for 
therapeutic use are m I, 2Il At, 77 Br, lg6 Re, ,88 Re, 2I2 Pb, 2l2 Bi, 109 Pd, "Cu, and 67 Cu. 

Such agents may be attached to the polypeptide by any suitable conventional 

procedure. The polypeptide comprises functional groups on amino acid side chains that can 

be reacted with functional groups on a desired agent to form covalent bonds, for example. 
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Alternatively, the protelWr agent may be derivatized to generat^rattach a desired reactive 
functional group. The derivatization may involve attachment of one of the bi functional 
coupling reagents available for attaching various molecules to proteins (Pierce Chemical 
Company. Rockford, Illinois). A number of techniques for radiolabeling proteins are 
known. Radionuclide metals may be attached to polypeptides by using a suitable 
bi functional chelating agent, for example. 

Conjugates comprising polypeptides and a suitable diagnostic or therapeutic agent 
(preferably covalently linked) are thus prepared. The conjugates are administered or 
otherwise employed in an amount appropriate for the particular application. 



Therapeutic Agents 

Polypeptides of the invention may be used in developing treatments for any disorder 
mediated (directly or indirectly) by defective, or insufficient amounts of the polypeptides. 
These polypeptides may be administered to a mammal afflicted with such a disorder. 
15 The polypeptides may also be employed in inhibiting a biological activity of 

(binding partner), in in vitro or in vivo procedures. For example, a purified polypeptide may 
be used to inhibit binding of (binding partner) to endogenous cell surface (binding partner). 
Biological effects that result from the binding of (binding partner) to endogenous receptors 
thus are inhibited. 

20 IMX polypeptides may be administered to a mammal to treat a (binding partner- 

mediated disorder. Such (binding partner)-mediated disorders include conditions caused 
(directly or indirectly) or exacerbated by (binding partner). 

Compositions of the present invention may contain a polypeptide in any form 
described herein, such as native proteins, variants, derivatives, oligomers, and biologically 

25 active fragments. In particular embodiments, the composition comprises a soluble 
polypeptide or an oligomer comprising soluble polypeptides. 

Compositions comprising an effective amount of a polypeptide of the present 
invention, in combination with other components such as a physiologically acceptable 
diluent, carrier, or excipient, are provided herein. The polypeptides can be formulated 

30 according to known methods used to prepare pharmaceutical^ useful compositions. They 

can be combined in admixture, either as the sole active material or with other known active 

materials suitable for a given indication, with pharmaceutically acceptable diluents (e.g., 

saline, Tris-HCl, acetate, and phosphate buffered solutions), preservatives (e.g., thimerosal, 
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benzyl alcohol, paraben?r emulsifiers, solubilizers, adjuvants amrar earners. Suitable 
formulations for pharmaceutical compositions include those described in Remington's 
Pharmaceutical Sciences. 1 6th ed. 1980, Mack Publishing Company, Easton. PA. 

In addition, such compositions can be complexed with polyethylene glycol (PEG), 
metal ions, or incorporated into polymeric compounds such as polyacetic acid, polyglycolic 
acid, hydrogels, dextran, etc., or incorporated into liposomes, microemulsions, micelles, 
unilamellar or multilamellar vesicles, erythrocyte ghosts or spheroblasts. Such 
compositions will influence the physical state, solubility, stability, rate of in vivo release, 
and rate of in vivo clearance, and are thus chosen according to the intended application. 

The compositions of the invention can be administered in any suitable manner, e.g., 
topically, parenterally, or by inhalation. The term "parenteral" includes injection, e.g., by 
subcutaneous, intravenous, or intramuscular routes, also including localized administration, 
e.g., at a site of disease or injury. Sustained release from implants is also contemplated. 
One skilled in the pertinent art will recognize that suitable dosages will vary, depending 
upon such factors as the nature of the disorder to be treated, the patient's body weight, age, 
and general condition, and the route of administration. Preliminary doses can be determined 
according to animal tests, and the scaling of dosages for human administration is performed 
according to art-accepted practices. 

Compositions comprising nucleic acids in physiologically acceptable formulations 
are also contemplated. DNA may be formulated for injection, for example. 



Research Agents 

Another use of the polypeptide of the present invention is as a research tool for 
studying the biological effects that result from inhibiting interactions between polypeptides 
of the invention and their "binding partner". Polypeptides also may be employed in in vitro 
assays for detecting binding partner or cells expressing the binding partner or the 
interactions thereof. 
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The polypeptides of the present invention can be subjected to fragmentation into 
smaller peptides by chemical and enzymatic means, and the peptide fragments so produced 
can be used in the analysis of other proteins or polypeptides. For example, such peptide 
fragments can be used as peptide molecular weight markers, peptide isoelectric point 
markers, or in the analysis of the degree of peptide fragmentation. Thus, the invention also 
includes these polypeptides and peptide fragments, as well as kits to aid in the determination 
of the apparent molecular weight and isoelectric point of an unknown protein and kits to 
assess the degree of fragmentation of an unknown protein. 

Although all methods of fragmentation are encompassed by the invention, chemical 
fragmentation is a preferred embodiment, and includes the use of cyanogen bromide to 
cleave under neutral or acidic conditions such that specific cleavage occurs at methionine 
residues (E. Gross, Methods in Enz. 1 1:238-255, 1967). This can further include additional 
steps, such as a carboxymethylation step to convert cysteine residues to an unreactive 
species. 

Enzymatic fragmentation is another preferred embodiment, and includes the use of a 

protease such as Asparaginylendo-peptidase, Arginylendo-peptidase, Achromobacter 

protease I, Trypsin, Staphlococcus aureus V8 protease, Endoproteinase Asp-N, or 

Endoproteinase Lys-C under conventional conditions to result in cleavage at specific amino 

acid residues. Asparaginylendo-peptidase can cleave specifically on the carboxyi side of the 

asparagine residues present within the polypeptides of the invention. Arginylendo-peptidase 

can cleave specifically on the carboxyi side of the arginine residues present within these 

polypeptides. Achromobacter protease I can cleave specifically on the carboxyi side of the 

lysine residues present within the polypep tides (Sakiyama and Nakat, U.S. Patent 

No. 5,248,599; T. Masaki et al., Biochim. Biophys. Acta 660:44-50, 1981; T. Masaki et al., 

Biochim. Biophys. Acta 660:51-55, 1981). Trypsin can cleave specifically on the carboxyi 

side of the arginine and lysine residues present within polypeptides of the invention. 

Enzymatic fragmentation may also occur with a protease that cleaves at multiple amino acid 

residues. For example, Staphlococcus aureus V8 protease can cleave specifically on the 

carboxyi side of the aspartic and glutamic acid residues present within polypeptides (D. W. 

Cleveland, J. Biol. Chem. 3:1 102-1 106, 1977). Endoproteinase Asp-N can cleave 

specifically on the amino side of the asparagine residues present within polypeptides. 

Endoproteinase Lys-C can cleave specifically on the carboxyi side of the lysine residues 
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present within polypeptides of the invention. Other enzymatic ancrchemical treatments can 
likewise be used to specifically fragment these polypeptides into a unique set of specific 
peptides. 

Of course, the peptides and fragments of the polypeptides of the invention can also 
5 be produced by conventional recombinant processes and synthetic processes well known in 
the art. With regard to recombinant processes, the polypeptides and peptide fragments 
encompassed by invention can have variable molecular weights, depending upon the host 
cell in which they are expressed. Glycosylation of polypeptides and peptide fragments of 
the invention in various cell types can result in variations of the molecular weight of these 

10 pieces, depending upon the extent of modification. The size of these pieces can be most 
heterogeneous with fragments of polypeptide derived from the extracellular portion of the 
polypeptide. Consistent polypeptides and peptide fragments can be obtained by using 
polypeptides derived entirely from the transmembrane and cytoplasmic regions, pretreating 
with N-glycanase to remove glycosylation, or expressing the polypeptides in bacterial hosts. 

15 The molecular weight of these polypeptides can also be varied by fusing additional 

peptide sequences to both the amino and carboxyl terminal ends of polypeptides of the 
invention. Fusions of additional peptide sequences at the amino and carboxyl terminal ends 
of polypeptides of the invention can be used to enhance expression of these polypeptides or 
aid in the purification of the protein. In addition, fusions of additional peptide sequences at 

20 the amino and carboxyl terminal ends of polypeptides of the invention will alter some, but 
usually not all, of the fragmented peptides of the polypeptides generated by enzymatic or 
chemical treatment. Of course, mutations can be introduced into polypeptides of the 
invention using routine and known techniques of molecular biology. For example, a 
mutation can be designed so as to eliminate a site of proteolytic cleavage by a specific 

25 enzyme or a site of cleavage by a specific chemically induced fragmentation procedure. The 
elimination of the site will alter the peptide fingerprint of polypeptides of the invention upon 
fragmentation with the specific enzyme or chemical procedure. 

The polypeptides and the resultant fragmented peptides can be analyzed by methods 
including sedimentation, electrophoresis, chromatography, and mass spectrometry to 

30 determine their molecular weights. Because the unique amino acid sequence of each piece 

specifies a molecular weight, these pieces can thereafter serve as molecular weight markers 

using such analysis techniques to assist in the determination of the molecular weight of an 

unknown protein, polypeptides or fragments thereof. The molecular weight markers of the 
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tr.\ ention serve particuHWy well as molecular weight markers fd^ne estimation of the 
apparent molecular weight of proteins that have similar apparent molecular weights and, 
consequently, allow increased accuracy in the determination of apparent molecular weight 
ofrroteins. 

5 When the invention relates to the use of fragmented peptide molecular weight 

markers, those markers are preferably at least 10 amino acids in size. More preferably, these 
fragmented peptide molecular weight markers are between 10 and 100 amino acids in size. 
Even more preferable are fragmented peptide molecular weight markers between 10 and 50 
amino acids in size and especially between 10 and 35 amino acids in size. Most preferable 

10 are fragmented peptide molecular weight markers between 10 and 20 amino acids in size. 

Among the methods for determining molecular weight are sedimentation, gel 
electrophoresis, chromatography, and mass spectrometry. A particularly preferred 
embodiment is denaturing polyacrylamide gel electrophoresis (U. K. Laemmli, Nature 
22^:680-685, 1970). Conventionally, the method uses two separate lanes of a gel containing 

1 5 sodium dodecyl sulfate and a concentration of acrylamide between 6-20%. The ability to 
simultaneously resolve the marker and the sample under identical conditions allows for 
increased accuracy. It is understood, of course, that many different techniques can be used 
for the determination of the molecular weight of an unknown protein using polypeptides of 
the invention, and that this embodiment in no way limits the scope of the invention. 

20 Each unglycosylated polypeptide or fragment thereof has a pi that is intrinsically 

determined by its unique amino acid sequence (which pi can be estimated by the skilled 
artisan using any of the computer programs designed to predict pi values currently available, 
calculated using any well-known amino acid pKa table, or measured empirically). 
Therefore these polypeptides and fragments thereof can serve as specific markers to assist in 

25 the determination of the isoelectric point of an unknown protein, polypeptide, or fragmented 
peptide using techniques such as isoelectric focusing. These polypeptide or fragmented 
peptide markers serve particularly well for the estimation of apparent isoelectric points of 
unknown proteins that have apparent isoelectric points close to that of the polypeptide or 
fragmented peptide markers of the invention. 

30 The technique of isoelectric focusing can be further combined with other techniques 

such as gel electrophoresis to simultaneously separate a protein on the basis of molecular 

weight and charge. The ability to simultaneously resolve these polypeptide or fragmented 

peptide markers and the unknown protein under identical conditions allows for increased 
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accuracy in the determifTSnon of the apparent isoelectric point of ofe unknown protein. This 
is of particular interest in techniques, such as two dimensional electrophoresis (T.D. Brock 
and M.T. Madigan, Biology of Microorganisms 76-77 (Prentice Hall, 6d ed. 1991)), where 
the nature of the procedure dictates that any markers should be resolved simultaneously with 
5 the unknown protein. In addition, with such methods, these polypeptides and fragmented 
peptides thereof can assist in the determination of both the isoelectric point and molecular 
weight of an unknown protein or fragmented peptide. 

Polypeptides and fragmented peptides can be visualized using two different methods 
that allow a discrimination between the unknown protein and the molecular weight markers. 

10 In one embodiment, the polypeptide and fragmented peptide molecular weight markers of 
the invention can be visualized using antibodies generated against these markers and 
conventional immunoblotting techniques. This detection is performed under conventional 
conditions that do not result in the detection of the unknown protein. It is understood that it 
may not be possible to generate antibodies against all polypeptide fragments of the 

1 5 invention, since small peptides may not contain immunogenic epitopes. It is further 

understood that not all antibodies will work in this assay; however, those antibodies which 
are able to bind polypeptides and fragments of the invention can be readily determined using 
conventional techniques. 

The unknown protein is also visualized by using a conventional staining procedure. 

20 The molar excess of unknown protein to polypeptide or fragmented peptide molecular 
weight markers of the invention is such that the conventional staining procedure 
predominantly detects the unknown protein. The level of these polypeptide or fragmented 
peptide molecular weight markers is such as to allow little or no detection of these markers 
by the conventional staining method. The preferred molar excess of unknown protein to 

25 polypeptide molecular weight markers of the invention is between 2 and 100,000 fold. 
More preferably, the preferred molar excess of unknown protein to these polypeptide 
molecular weight markers is between 10 and 10,000 fold and especially between 100 and 
1 ,000 fold. 

It is understood of course that many techniques can be used for the determination 
30 and detection of molecular weight and isoelectric point of an unknown protein, 

polypeptides, and fragmented peptides thereof using these polypeptide molecular weight 
markers and peptide fragments thereof and that these embodiments in no way limit the 
scope of the invention. 
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In another embodiment, the analysis of the progressive fragmentation of the 
polypeptides of the invention into specific peptides (D. W. Cleveland et al.. J. Biol. Chem. 
252: 1 102-1 1 06, 1977), such as by altering the time or temperature of the fragmentation 
reaction, can be used as a control for the extent of cleavage of an unknown protein. For 
example, cleavage of the same amount of polypeptide and unknown protein under identical 
conditions can allow for a direct comparison of the extent of fragmentation. Conditions that 
result in the complete fragmentation of the polypeptide can also result in complete 
fragmentation of the unknown protein. 

As to the specific use of the polypeptides and fragmented peptides of the invention 
as molecular weight markers, the fragmentation of the polypeptides of SEQ ID NOs:24-33 
with cyanogen bromide generates a unique set of fragmented peptide molecular weight 
markers. An additional fragment results if the initiating methionine is present. The 
distribution of methionine residues determines the number of amino acids in each peptide 
and the unique amino acid composition of each peptide determines its molecular weight. 

In addition, the preferred purified polypeptides of the invention (SEQ ED NOs:24- 
33) have calculated molecular weights of approximately 3683, 1783, 1 1248, 75503, 43040, 
8051, 33306. 3515, 10736, 25162, and 2450 Daltons. 

Where an intact protein is used, the use of these polypeptide molecular weight 
markers allows increased accuracy in the determination of apparent molecular weight of 
proteins that have apparent molecular weights close to 3683, 1783, 1 1248, 75503, 43040, 
8051, 33306. 3515, 10736, 25162, or 2450 Daltons. Where fragments are used, there is 
increased accuracy in determining molecular weight over the range of the molecular weights 
of the fragment. 

Finally, as to the kits that are encompassed by the invention, the constituents of such 
kits can be varied, but typically contain the polypeptide and fragmented peptide molecular 
weight markers. Also, such kits can contain the polypeptides wherein a site necessary for 
fragmentation has been removed. Furthermore, the kits can contain reagents for the specific 
cleavage of the polypeptide and the unknown protein by chemical or enzymatic cleavage. 
Kits can further contain antibodies directed against polypeptides or fragments thereof of the 
invention. 

Identification of Unknown Proteins 
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As set forth above, a polypeptide or peptide fingerprint can be entered into or 
compared to a database of known proteins to assist in the identification of the unknown 
protein using mass spectrometry (W.J. Henzel et al.. Proc. SatL Acad. Sci. USA 90:501 1- 
5015. 1993; D. Fenyo et al., Electrophoresis 19:995-1005, 1998). A variety of computer 
5 software programs to facilitate these comparisons are accessible via the Internet, such as 
Protein Prospector (Internet site: prospector.uscf.edu), Multildent (Internet site: 
www.expasy.ch sprot/multiident.html), PeptideSearch (Internet site: www.mann.embl- 
heiedelberg.de. ..deSearch/FR_PeptideSearch Form.html), and ProFound (Internet site: 
www.chait-sgi.rockefeller.edu/cgi-bin/prot-id-frag.html). These programs allow the user to 

10 specify the cleavage agent and the molecular weights of the fragmented peptides within a 
designated tolerance. The programs compare these molecular weights to protein databases 
to assist in determining the identity of the unknown protein. 

In addition, a polypeptide or peptide digest can be sequenced using tandem mass 
spectrometry (MS/MS) ^ the resulting sequence searched against databases (J.K. Eng, et 

15 al., 7. Am. Soc. Mass Spec. 5:976-989 (1994); M. Mann and M. Wilm, Anal. Chem. 

66:4390-4399 (1994); J.A. Taylor and R.S. Johnson, Rapid Comm. Mass Spec. 1 1:1067- 
1075 ( 1 997)). Searching programs that can be used in this process exist on the Internet, 
such as Lutefisk 97 (Internet site: www.isbc.com: 70 Lutefisk97.html), and the Protein 
Prospector, Peptide Search and ProFound programs described above. Therefore, adding the 

20 sequence of a gene and its predicted protein sequence and peptide fragments to a sequence 
database can aid in the identification of unknown proteins using tandem mass spectrometry. 

Antibodies 

Antibodies that are immunoreactive with the polypeptides of the invention are 
provided herein. Such antibodies specifically bind to the polypeptides via the antigen- 
binding sites of the antibody (as opposed to non-specific binding). Thus, the polypeptides, 
fragments, variants, fusion proteins, etc., as set forth above may be employed as 
immunogens in producing antibodies immunoreactive therewith. 

Polyclonal and monoclonal antibodies may be prepared by conventional techniques. 
See, for example, Monoclonal Antibodies, Hybridomas: A New Dimension in Biological 
Analyses , Kennet et al. (eds.), Plenum Press, New York (1980); and Antibodies: A 
Laboratory Manual \ Harlow and Land (eds.), Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, (1988). 
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Antigen-binding fragments of such antibodies, which maybe produced by 
conventional techniques, are also encompassed by the present invention. Examples of such 
fragments include, but are not limited to, Fab and F(ab'): fragments. Antibody fragments 
and derivatives' produced by genetic engineering techniques are also provided. 
5 The monoclonal antibodies of the present invention include chimeric antibodies, 

e.g., humanized versions of murine monoclonal antibodies. Such humanized antibodies 
may be prepared by known techniques, and offer the advantage of reduced immunogenic ity 
when the antibodies are administered to humans. In one embodiment, a humanized 
monoclonal antibody comprises the variable region of a murine antibody (or just the antigen 

10 binding site thereof) and a constant region derived from a human antibody. Alternatively, a 
humanized antibody fragment may comprise the antigen binding site of a murine 
monoclonal antibody and a variable region fragment (lacking the antigen-binding site) 
derived from a human antibody. Procedures for the production of chimeric and further 
engineered monoclonal antibodies include those described in Riechmann et al. {Nature 

15 3J2:323, 1988), Liu et al. (PNAS 84:3439, 1987), Larrick et al. (Bio/Technology 7:934, 

1989), and Winter and Harris (TIPS 74:139, May, 1993). Procedures to generate antibodies 
transgenically can be found in GB 2,272,440, US Patent Nos. 5,569,825 and 5,545,806 and 
related patents claiming priority therefrom, all of which are incorporated by reference 
herein. 

20 In one embodiment, the antibodies are specific for the polypeptides of the present 

invention, and do not cross-react with other proteins. Screening procedures by which such 
antibodies may be identified are well known, and may involve immunoaffinity 
chromatography, for example. 

Hyforidoma cell lines that produce monoclonal antibodies specific for the 

25 polypeptides of the invention are also contemplated herein. Such hybridomas may be 
produced and identified by conventional techniques. One method for producing such a 
hybridoma cell line comprises immunizing an animal with a polypeptide; harvesting spleen 
cells from the immunized animal; fusing said spleen cells to a myeloma cell line, thereby 
generating hybridoma cells; and identifying a hybridoma cell line that produces a 

30 monoclonal antibody that binds the polypeptide. The monoclonal antibodies may be 
recovered by conventional techniques. 



Uses Thereof 
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^^^e invention can be used in assavs to <^P^t 



The antibodies dffne invention can be used in assays to deffct the presence of the 
polypeptides or fragments of the invention, either in vitro or in vivo. The antibodies also 
may be employed in purifying polypeptides or fragments of the invention by immunoaffinity 
chromatography. 

5 Those antibodies that additionally can block binding of the polypeptides of the 

invention to the binding partner may be used to inhibit a biological activity that results from 
such binding. Such blocking antibodies may be identified using any suitable assay 
procedure, such as by testing antibodies for the ability to inhibit binding of the binding 
partner to certain cells expressing the binding partner. Alternatively, blocking antibodies 

10 may be identified in assays for the ability to inhibit a biological effect that results from 
binding of the binding partner to target cells. 

Such an antibody may be employed in an in vitro procedure, or administered in vivo 
to inhibit a biological activity mediated by the entity that generated the antibody. Disorders 
caused or exacerbated (directly or indirectly) by the interaction of (binding partner) with cell ? 

1 5 surface (binding partner) receptor thus may be treated. A therapeutic method involves in 

vivo administration of a blocking antibody to a mammal in an amount effective in inhibiting 
a (binding partner)- mediated biological activity. Monoclonal antibodies are generally 
preferred for use in such therapeutic methods. In one embodiment, an antigen-binding 
antibody fragment is employed. 

20 Antibodies may be screened for agonistic (i.e., ligand-mimicking) properties. Such 

antibodies, upon binding to cell surface antigen, induce biological effects (e.g., transduction 
of biological signals) similar to the biological effects induced when (binding partner) binds 
to cell surface antigen. 

Compositions comprising an antibody that is directed against a polypeptide of the 

25 invention, and a physiologically acceptable diluent, excipient, or carrier, are provided 
herein. 

Also provided herein are conjugates comprising a detectable (e.g., diagnostic) or 
therapeutic agent, attached to the antibody. Examples of such agents are presented above. 
The conj ugates find use in in vitro or in vivo procedures. 
30 The following examples are provided to further illustrate particular embodiments of 

the invention, and are not to be construed as limiting the scope of the present invention. 



EXAMPLE 1: Isolation of the IMX Nucleic Acids 
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The T84 Epithelial BanTgrModel 

As discussed above, damage to the intestinal epithelial barrier is a hallmark of 
(EBD), and a number of in vitro models of epithelial barrier function have been developed 
over the years. The best characterized of these models is the TS4 intestinal epithelial barrier 
5 system, Dharmsathaphorn et al.. Am. J. Physiol 246:G204-G208. 1984 and Madara et al., J. 
Cell BioL, 101:2124-2133.1985). 

T84 cells were plated on 75 mm polycarbonate transwell filter inserts (Costar) and 
grown in DME/F12 (1:1 ) containing 10% heat- inactivated bovine calf serum. The cells 
were maintained at confluence for 2-3 days, and integrity of the epithelial barrier was 

10 determined by measuring transepithelial electrical resistance (TER) using an EVOM 

epithelial voltohmmeter (World Precision Instruments). When the TER values were greater 
than 1000 ohms/cm2 and were stable, cells were treated with interferon-g (30 ng/ml, 
Genzyme) added to the basolateral side of the membrane. At various times after treatment 
(4, 24 and 44h), TERs were measured to monitor the interferon- induced disruption of the 

15 barrier, and RNA was harvested from the cells at those time points using TRIzol reagent 
(Life Technologies). RNA was extracted using conventional methods and subsequently 
used for TOGA™ analysis as described in Example 2. 
TOGA™ analysis 

This example describes a method for determining mRNA expression characteristics. 

20 The isolated RNA was analyzed using a method of simultaneous sequence-specific 

identification of mRNAs known as TOGA™ (Total Gene expression Analysis) described in 
U.S. Patent No. 5,459,037 and U.S. Patent No. 5,807,680, hereby incorporated herein by 
reference. Preferably, prior to the application of the TOGA™ technique, the isolated RNA 
was enriched to form a starting polyA-containing mRNA population by methods known in 

25 the art. In a preferred embodiment, the TOGA™ method further comprised an additional 
PCR step performed using four separate reactions, one for each of the four 5' PCR primers, 
and cDNA templates prepared from a population of antisense cRNAs. A final PCR step 
used 256 5' PCR primers in 64 subpools for each of the four reactions of the previous step 
produced PCR products that were cDNA fragments that corresponded to the 3*-region of the 

30 starting mRNA population. 

The produced PCR products were then identified by a) the sequence of at least the 5* 

seven base pairs, preferably the sequence of the entire fragment, and b) the length of the 

fragment. These two parameters, sequence and fragment length, were used to compare the 
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obtained PCR products ^^database of known polynucleotide se^fcces. A database 
search for homologous sequences in Genbank resulted in no matches, indicating the noveltv 
of the [MX sequences of the invention. The intensities of the PCR products were compared 
across the 4 input RNA samples (t=0, 4hr, 24hr, 44hr) and species that were regulated were 
identified and further characterized (Table 1). 



The DSTs of SEQ ID NO: 1-23 and fragments thereof are useful as probes to study 
and diagnose the changes in gene expression demonstrated by the data of Table 1 . 
Alternatively, polypeptides and fragments thereof that are the translations of SEQ ED NO:l- 
23 and fragments thereof are useful as probes to study and diagnose the changes in gene 
expression demonstrated by the data of Table 1. 
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Relative PCR Fragment Amount 


Name 


DST SEQ 
ID NO: 


Digital 
Address (Msp I) 


0 


4hr 


24hr 


44hr 


IMX 4 


I 


TAAG IS 


41 


94 


350 


299 


IMX 10 


14 


TTCT 1^9 


250 


716 


250 


207 


IMX 21 


• * 

.Is 


CTTC 364 


35 


60 


330 


749 


IMX 28 


16 


CCAT414 


138 


435 


647 


464 


IMX 32 


17 


CCGA 374 


237 


539 


25 


32 


IMX 39 


19 


TGGA 450 


24 


87 


no 


78 


EMX 40 


7 


TCTA 163 


38 


294 


67 


153 


IMX 42 


8 


GAGC 426 


145 


733 


876 


1076 


IMX 44 


21 


CTGC 22 1 


248 


344 


1499 


1624 


IMX 56 


10 


GATA 1 1 1 


955 


1223 


2357 


3450 



Table 1. IMX Nucleic Acid Molecules. Relative expression levels determined by TOGA™. 

5 EXAMPLE 2: Further Characterization of IMX4 

Nucleic Acid Molecules and Polypeptides 

An additional 569bp sequence (SEQ ID NO: 1 1) that included DST IMX4 (SEQ ED 
NO:l) was obtained by anchor PCR from a T84 library. A clone approximately 2.0kb in 
length was then isolated from a commercial library (Origene). The sequence of the 3' end of 

10 the clone (SEQ ID NO: 12) included the sequence of the IMX4 DST. The sequence of the 
5' end of the clone (SEQ ID NO: 13) matched part of the human ApoL gene (AJF019225, 
Figure 21). Complete sequence of the clone was not obtained. However, comparison of 5* 
end of the clone suggests it represents an alternative splice product to the reported ApoL 
sequence, i.e., bases -i- 168 match to 2 exons on PAC carrying the ApoL gene but are not 

15 included in the reported complete cDNA. See Figure 22. A polypeptide translated from 
IMX4 is provided in SEQ ID NO: 24. 

EXAMPLE 3: Further Characterization of IMX 10 and IMX21 

Nucleic Acid Molecules and Polypeptides 

20 
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Further experimet^Tfailed to show regulation of these DS 





replications using 



PCR with extended primers and Northern blots. A polypeptide translated from IMX10 (SEQ 
ID NO: 14) is provided in SEQ ED NO: 25. A polypeptide translated from EMX21 (SEQ [D 
NO: 1 5) is provided in SEQ ID NO: 26. 



ligands for this enzyme. The GST fusion of IMX28 was prepared as follows: IMX28-8, a 
clone with the full length coding sequence, was excised from pGEM by Sall-NotI digestion 
and subcloned into Sall-NotI digested pGEX-5X-3 (Pharmacia). The pGEX-5X-3 plasmid 
contains the Schistosoma iaponicum glutathione-S-transferase (GST) gene 5' to the Sall- 

15 NotI cloning site. Insertion of the IMX28-8 DNA fragment into this cloning site results in a 
fusion gene containing GST-IMX28 in the proper reading frame. The complete sequence of 
the fusion protein consists of GST followed by a Factor Xa tripeptide cleavage site, the 
peptide consisting of Gly-De-Pro-Arg-Asn-Ser-Arg-Val-Asp-Ala-Thr that is derived 
partially from the pGEX vector and partially from the Sall-NotI fragment of IMX28 and 

20 1MX28. The vector containing the fusion gene was electroporated into competent bacterial 
cells and a single colony containing the DNA, confirmed by diagnostic restriction digestion, 
was expanded to a 250ml culture for purification of the fusion protein. 

Briefly, cells from these cultures were pelleted and resupended in STE buffer 
containing a protease inhibitor cocktail and 20ug/ml hen egg lysozyme. The cells were lysed 

25 by addition of 10% sarcosyl in STE (final sarcosyl concentration was 1.5%), sonicated and 
centrifiiged to remove cell debris. The supernatant was passed over a Glutathione-Sepharose 
4B column, washed sequentially with 1.5% sarcosyl in STE and then STE. The bound 
material, containing the GST-EMX28 fusion protein was eluted by adding 20mM 
Glutathione in STE. The eluted material was analyzed by SDS-PAGE to confirm the 

30 presence of the GST-IMX28 fusion protein. 

A recombinant adenovirus construct expressing EMX28 was used to infect T84 cells 
to determine effects on epithelial barrier formation. To construct adenoviral vectors with 
untagged full length Imx28 and NH2-terminally FLAG tagged Imx28, the p AD EASY! 



EXAMPLE 4: Further Characterization of IMX28 
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Nucleic Acid Molecules and Polypeptides 

One polypeptide translated from IMX28 is provided in SEQ ED NO: 27. 

A GST fusion protein of IMX28 was used in studies designed to identify potential 





system described by H^ffaL (199S ) Proc. Natl. Acad. Sci. US^ 95: 2509-2514, was 
used. For the untagged construct a Notl fragment carrying the full coding region of lmx28 
was inserted into the polylinker of the pAdtrackCMV shuttle vector. After confirmation of 
the correct orientation the vector was cut with the restriction enzyme Pacl and co- 
transformed into E. coli . Strain BJ51S3 along with the pADEasyl vector. Recombinants 
containing the lmx28 product were screened for as described and by gene specific PCR. To 
generate non-replicative, infectious viral particles the helper strain 293EBNA-EBNA or 
293MSR was transfected with Pme l linearized recombinant vector. After 3 davs virus was 
harvested by freeze-thaw lysis of the transfected cells. Several rounds of infection of these 
cells with viral supernatents were performed to boost titers. The tagged version was 
prepared by addition of the FLAG sequence and a Gly-Gly-Gly-Gly spacer region onto the 
full length coding sequence of Imx28 via PCR with a 5' oligo with the incorporated 
nucleotides for the tag and spacer. The PCR product was cloned into a TA- vector and 
excised with Not l and cloned as above. Screening and viral particle production was as 
above. No deleterious or positive effects were observed on T84 barrier function in the 
presence or absence of IFN-gamma. 



The original clone produced by extending the IMX32 DST was 1588 bp in length 
(SEQ ID NO: 17). The first round of anchor PCR yielded a 670bp product that was used to 
produce a contig that extended the sequence 585 bases to 2173 (SEQ ID NO: 18). Another 
round of anchor PCR yielded the 1676 bp product, immunex32-a24.seq. which further 
extended the contig for Imx32 to 2834 bp. In addition, the immunex32-a24.seq product 
revealed an alternative splice lacking bases 21 1-639 of EMX32-1 clone. 

There is an ORF for 155 amino acids at the 5' end of the sequence (SEQ ID NO: 28 
translated from SEQTD NO: 17). BLAST analysis suggests that the ORF may contain a 
KRAB-like domain because it has partial matches to Zn-finger proteins which contain 
KRAB domains at their N-terminus such as ZNF140. 



EXAMPLE 5: Further Characterization of IMX32 



Nucleic Acid Molecules and Polypeptides 



EXAMPLE 6: Further Characterization of IMX39 



Nucleic Acid Molecules and Polypeptides 

One polypeptide translated from IMX39 is provided in SEQ ED NO: 29. 
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EMX39 FLAG-tagged and untagged adenoviral vector versions were prepared as 
outlined above for Imx28 in Example 4 % above. Controls for the expression were infection 
with other FLAG- tagged adenoviral delivered proteins such as EMX5. Based upon the 
predicted structure from its cDNA sequence, it was expected that the IMX39 polypeptide 
would be a cytoplasmic protein. However, since some secreted cytokines, such as those of 
the IL-1 family, lack a predicted signal sequence but are secreted from the cell, this 
expectation was tested experimentally in both T84 cells via adenoviral mediated 
transduction and via transfection into CV1EBNA cells with another expression vector, 
pDC412. Despite evidence for succesful expression in the transfected cells, no product was 
found in the media as determined by FLAG western (T84 system) or 35-S radiolabeled 
product (CV1 EBNA system). The positive control in the CV1 system was an Imx44-Fc 
fusion protein known to be secrete. In both systems empty vectors were used as a negative 
control. The attempted expression of EMX39 polypeptides had no apparent effect on barrier 
function in the absence or presence of EFN-gamma. 

EXAMPLE 7: Further Characterization of IMX40 
Nucleic Acid Molecules and Polypeptides 

Anchor PCR using an Origene human small intestine library revealed a product of 
1669 bp (SEQ ID NO: 20). Bases 1-235 of the DST IMX40 (SEQ ID NO:7) align with 
bases 1565-1669 of this sequence. The anchor PCR product has a potential 155 aa ORF 
from bases 185-650. A translation of SEQ ID NO: 7 is given in SEQ ID NO:30. This ORF 
has no recognizable features, motifs, or database homologies. 

EXAMPLE 8: Further Characterization of IMX42 
Nucleic Acid Molecules and Polypeptides 

Several attempts to screen libraries using DST IMX42 (SEQ ID NO:8) as well as 
anchor PCR and RACE have not produced a clone with a longer sequence. BLAST 
comparison against imxhutdb extended sequence approx 200 bases to 592 bases. However, 
the sequence extension did not increase any database matches found by comparison to the 
DST IMX42 sequence. The contig has a potential ORF of 134 amino acids which, while a 
COOH terminal extension of SEQ ID NO:31, may be a partial sequence. 
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EXAMPLE 9: Further Characterization of I-V1X44 





Nucleic Acid Molecules and Polypeptides 



Longer polynucleotides corresponding to DST IMX44 have been identified (SEQ ED 
NO:21,22). A translated polypeptide is found in SEQ ID NO:32. A solubte Fc form of 
CMX44 polypeptide was synthesized and used in various assays. IMX44-Fc had no effect on 
TS4 barrier function in the absence or presence of EFN-gamma. IMX44-Fc had no effect on 
natural killer (NK) cell activation. IMX44-Fc had no positive hits on cognate screen assays. 

A soluble FLAG polyHis form was also produced. No activity in cellular activation 
assays was found nor any alteration of cytokine production using this polypeptide in assays. 

The expression of IMX44 in various murine models of gut inflammation was 
determined by Northern and array analysis. Little to no regulation of transcript was found 
was found in anti-CD3-induced ileitis in C57BL/6 mice, DSS-induced colitis in BALB/c 
mice or C57BL/6 mice, mdr\ knock out mice with colitis, and LFN-gamma stimulated LN T 
cells. 



A translated IMX56 polypeptide is found in SEQ ID NO:33. Comparison of the 
DST sequence IMX56 to IMAGE consortium clones extended the sequence (SEQ ID 
NO:23) which was 3' on sequenced PAC to the described end of human ApoL. Anchor 
PCR using a T84 library produced results that indicated that the IMX56 DST is derived 
from an alternate 3' UTR of ApoL. 

EXAMPLE 11: Monoclonal Antibodies That Bind IMX Polypeptides 

Monoclonal antibodies that bind the polypeptides of the invention can be prepared 
by methods well known in the art. Suitable immunogens that may be employed in 
generating such antibodies include, but are not limited to, purified IMX polypeptides or an 
immunogenic fragment thereof such as the extracellular domain, or fusion proteins 
containing IMX polypeptides (e.g., a soluble IMX 21polypeptide/Fc fusion protein). 

Purified IMX polypeptides of the invention can be used to generate monoclonal 
antibodies immunoreactive therewith, using conventional techniques such as those 
described in U.S. Patent 4,41 1,993. Briefly, mice are immunized with an IMX polypeptide 
immunogen emulsified in complete Freund's adjuvant, and injected in amounts ranging from 



EXAMPLE 10: Further Characterization of IMX56 



Nucleic Acid Molecules and Polypeptides 
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10-100 :g subcutaneously or intraperitoneally. Ten to twelve dayslater, the immunized 
animals 2.re boosted with additional EVIX polypeptide immunogen emulsified in incomplete 
Freund's adjuvant. Mice are periodically boosted thereafter on a weekly to bi-weekly 
immunization schedule. Serum samples are periodically taken by retro-orbital bleeding or 
5 tail-tip excision to test for anti-IMX polypeptide antibodies by dot blot assay, ELISA 

(Enzyme-Linked Immunosorbent Assay) or inhibition of IMX polypeptide/binding partner 
interactions. 

Following detection of an appropriate antibody titer, positive animals are provided 
one last intravenous injection of IMX polypeptide immunogen in saline. Three to four days 

10 later, the animals are sacrificed, spleen cells harvested, and spleen cells are fused to a 
murine myeloma cell line, e.g., NS 1 or preferably P3x63 Ag8.653 (ATCC CRL 1 580). 
Fusions generate hybridoma cells, which are plated in multiple microtiter plates in a HAT 
(hypoxanthine, aminopterin and thymidine) selective medium to inhibit proliferation of non- 
fused cells, myeloma hybrids, and spleen cell hybrids. 

1 5 The hybridoma cells are screened by ELISA for reactivity against the purified IMX 

polypeptide of interest by adaptations of the techniques disclosed in Engvall et al., 
Imrnunochem. 8:871, 1971 and in U.S. Patent 4,703,004. A preferred screening technique is 
the antibody capture technique described in Beckmann et al., (J. Immunol. 144:4212, 1990) 
Positive hybridoma cells can be injected intraperitoneally into syngeneic BALB/c mice to 

20 produce ascites containing high concentrations of anti-IMX polypeptide monoclonal 

antibodies. Alternatively, hybridoma cells can be grown in vitro in flasks or roller bottles by 
various techniques. Monoclonal antibodies produced in mouse ascites can be purified by 
ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to Protein A or Protein G can also 

25 be used, as can affinity chromatography based upon binding to the IMX polypeptide of 
interest. 

Alternatively, IMX nucleic acid molecules or fragments thereof can be expressed to 
produce IMX polypeptides or fragments thereof, that can be used to make antibodies that are 
useful for identifying corresponding polypeptides in techniques such as western blotting, 
30 immunocytochemistry ,and ELISA assays using standard techniques such as those described 
in U.S. Patent No. 4,900,81 1, incorporated by reference herein. 

The references cited herein are incorporated by reference herein in their entirety. 
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We claim: 



1. An isolated nucleic acid molecule comprising a polynucleotide chosen from 
the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ED NO:4, SEQ 

5 ID NO:5. SEQ ED NO:6, SEQ ED NO:7, SEQ ED NO:8, SEQ ED NO:9, SEQ ED NO: 10. 
SEQ ED NO: 1 1. SEQ ED NO: 12, SEQ ED NO: 13. SEQ ED NO: 14, SEQ ED NO: 15. SEQ ED 
NO: 16, SEQ ED NO: 17, SEQ ED NO: 18, SEQ ED NO:20, SEQ ED NO:21, and SEQ ED 
NO:23. 

2. The isolated nucleic acid molecule of claim I comprising a polynucleotide 
1 0 shown in SEQ ED NO: 1 . 

3. An isolated polypeptide chosen from the group consisting of SEQ ED NO:24, 
SEQ ED NO:25, SEQ ED NO:26, SEQ ED NO:27, SEQ ED NO:28, SEQ ID NO:29, SEQ ED 
NO:30, SEQ ED NO:3 i, SEQ ED NO:32, and SEQ ED NO:33. 

4. The isolated polypeptide of claim 3 comprising a polypeptide shown in SEQ 1 
15 EDNO:24. 

5. An isolated nucleic acid molecule comprising a polynucleotide at least 95% 
identical to the isolated nucleic acid molecule of claim 1. 

6. An isolated nucleic acid molecule at least ten bases in length that is 
hybridizable to the isolated nucleic acid molecule of claim 1 under stringent conditions. 

20 7. An isolated nucleic acid molecule encoding the polypeptide of claim 3. 

8. An isolated nucleic acid molecule encoding a fragment of the polypeptide of 
claim 3. 

9. An isolated nucleic acid molecule encoding a polypeptide epitope of the 
polypeptide of claim 3. 

25 10 The polypeptide of claim 3 wherein the polypeptide has biological activity. 

1 1 . An isolated nucleic acid encoding a species homologue of the polypeptide of 
claim 3. . 

12. The isolated nucleic acid molecule of claim 1, wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the 5* end or the 3*end. 

30 1 3. A recombinant vector comprising the isolated nucleic acid molecule of claim 

1. 

14. A recombinant host cell comprising the isolated nucleic acid molecule of 
claim 1. 
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15. A methoa of making the recombinant host cell oTTTaim 14 comprising 
transforming a host cell with the nucleic acid molecule of claim 1. 

16. The recombinant host cell of claim 14 comprising vector sequences. 

1 7. The isolated polypeptide of claim 3, wherein the isolated polypeptide 

5 comprises sequential amino acid deletions from either the C-terminus or the N-terminus. 

1 S. An isolated antibody that binds specifically to the isolated polypeptide of 

claim 3. 

19. The isolated antibody of claim 18 wherein the antibody is a monoclonal 
antibody. 

10 20. The isolated antibody of claim 1 8 wherein the antibody is a polyclonal 

antibody. 

21. A recombinant host cell that expresses the isolated polypeptide of claim 3. 

22. An isolated polypeptide produced by the steps of: 

(a) culturing the recombinant host cell of claim 14 under conditions such 
15 that said polypeptide is expressed; and 

(b) isolating the polypeptide. 

23. A method for preventing, treating, modulating, or ameliorating a medical 
condition, comprising administering to a mammalian subject a therapeutically effective 
amount of the polypeptide of claim 3 or the polynucleotide of claim 1. 

20 . 24. The method of claim 23 wherein the medical condition is inflammatory bow el 

disease. 

25. A method for preventing, treating, modulating, or ameliorating a medical 
condition comprising administering to a mammalian subject a therapeutically effective 
amount of the antibody of claim 1 8. 
25 26. The method of claim 25 wherein the medical condition is inflammatory bow el 

disease. 

27. A metfiod of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 
30 claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 
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28. The methuS^Tf claim 27 wherein the pathological c^mtion is inflammatory 





bowel disease. 

29. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising detecting an alteration in expression of a 

5 polypeptide of claim 3. w herein the presence of an alteration in expression of the 

polypeptide is indicative of the pathological condition or susceptibility to the pathological 
condition. 

30. The method of claim 29 wherein the alteration in expression is an increase in 
the amount of expression or a decrease in the amount of expression. 

10 31. The method of claim 29 wherein the pathological condition is inflammatory 

bowel disease. 

32. The method of claim 3 1 wherein the method further comprises the steps of: 
obtaining a first biological sample from a patient suspected of having inflammatory bowel 

i 

disease and obtaining a second sample from a suitable comparable control source; 
1 5 (a) determining the amount of at least one polypeptide encoded by a 

polynucleotide of claim 1 in the first and second sample; and 
(b) comparing the amount of the polypeptide in the first and second samples; 
wherein a patient is diagnosed as having inflammatory bowel disease if the amount 
of the polypeptide in the first sample is greater than or less than the amount of the 
20 polypeptide in the second sample. 

33. The use of the polynucleotide of claim 1 or polypeptide of claim 3 for the 
manufacture of a medicament for the treatment of inflammatory bowel disease. 

34. The use of the antibody of claim 18 for the manufacture of a medicament for 
the treatment of inflammatory bowel disease. 

25 35. A method for identifying a binding partner to the polypeptide of claim 3 

comprising: 

(a) contacting the polypeptide of claim 3 with a binding partner; and 

(b) determining whether the binding partner effects an activity of the 
polypeptide. 

30 36. The gene corresponding to the cDNA sequence of the isolated nucleic acid of 



claim 1. 



37. A method of identifying an activity of an expressed polypeptide in a 
biological assay, wherein the method comprises: 
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(a) expressing the polypeptide of claim 3 in a cell; 

(b) isolating the expressed polypeptide; 

(c) testing the expressed polypeptide for an activity in a biological assay; and 

(d) identifying the activity of the expressed polypeptide based on the test results. 
5 38. A substantially pure isolated DNA molecule suitable for use as a probe for 

genes regulated in inflammatory bowel disease, chosen from the group consisting of the 
DNA molecules identified in Table 1, having a 5' partial nucleotide sequence and length as 
described by their digital address, and having a characteristic regulation pattern in 
inflammatory bowel disease. 

10 39. A kit for detecting the presence of the polypeptide of the claim 3 in a 

mammalian tissue sample comprising a first antibody which immunoreacts with a 
mammalian protein encoded by a gene corresponding to the polynucleotide of claim 1 or 
with a polypeptide encoded by the polynucleotide of claim 3 in an amount sufficient for at 
least one assay and suitable packaging material. 

1 5 40. A kit of claim 39 further comprising a second antibody that binds to the first 

antibody. 

41 . The kit of claim 40 wherein the second antibody is labeled. 

42. The kit of claim 41 wherein the label comprises enzymes, radioisotopes, 
fluorescent compounds, colloidal metals, chemiluminescent compounds, phosphorescent 

20 compounds, or bioluminescent compounds. 

43. A kit for detecting the presence of a genes encoding an protein comprising a 
polynucleotide of claim 1, or fragment thereof having at least 10 contiguous bases, in an 
amount sufficient for at least one assay, and suitable packaging material. 

44. A method for detecting the presence of a nucleic acid encoding a protein in a 
25 mammalian tissue sample, comprising the steps of: 

(a) hybridizing a polynucleotide of claim 1 or fragment thereof having at least 10 
contiguous bases, with the nucleic acid of the sample; and 

(b) detecting the presence of the hybridization product. 

30 
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ABSTRACT of the disclosure 



The invention is directed to purified and isolated novel pol>peptides, the nucleic 
acids encoding such polypeptides, processes for production of recombinant forms of such 
5 polypeptides, antibodies generated against these polypeptides, fragmented peptides derived 
from these polypeptides, and the uses of the above. 
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